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DRUG TARGET ISOGENES: 
POLYMORPfflSMS IN THE OSTEOCLASTOGENESIS INHIBITORY FACTOR GENE 



RELATED APPLICATIONS 
S This application is a continuation-in-part of, and claims priority to, U.S. Provisional Application 

Serial No. 60/143,020 filed July 9. 1999. 

FIELD OF THE INVENTION 

This invention relates to variation in genes that encode pharmaceutically important proteins. In 
10 particular, this invention provides genetic variants of the human Osteoclastogenesis Inhibitory Factor 
(TNFRSFl IB) gene and methods for identifying which variant(s) of this gene is/are possessed by an 
individual. 

BACKGROUND OF THE INVENTION 

15 Current methods for identifying pharmaceuticals to treat disease often start by identifying, 

cloning, and expressing an important target protein related to the disease. A determination of whether an 
agonist or antagonist is needed to produce an effect that may benefit a patient with the disease is then 
made. Then, vast numbers of compounds are screened against the target protein to find new potential 
drugs. The desired outcome of this process is a drug that is specific for the target, thereby reducing the 

20 incidence of the undesired side effects usually caused by a compound's activity at non-intended targets. 

What this approach fails to consider, however, is that natural variability exists in any and every 
population with respect to a particular protein. A target protein currently used to screen drugs typically is 
expressed by a gene cloned fi^m an individual who was arbitrarily selected. However, the nucleotide 
sequence of a particular gene may vary tremendously among individuals. Subtle alteration(s) in the 

25 primary nucleotide sequence of a gene encoding a target protein may be manifested as significant 

variation in expression of or in the structure and/or function of the protein. Such alterations may explain 
the relatively high degree of uncertainty inherent in treatment of individuals with drugs whose design is 
based upon a single representative example of the target. For example, it is well-established that some 
classes of drugs fi-equently have lower efficacy in some individuals than others, which means such 

30 individuals and their physicians must weigh the possible benefit of a larger dosage against a greater risk 
of side effects. In addition, variable information on the biological function or effects of a particular 
protein may be due to different scientists unknowingly studying different isoforms of the gene encoding 
the protein. Thus, information on the type and frequency of genomic variation that exists for 
pharmaceutically important proteins would be useful. 

35 The organization of single nucleotide variations (polymorphisms) in the primary sequence of a 

gene into one of the limited number of combinations that exist as units of inheritance is termed a 
haplotype. Each haplotype dierefore contains significantly more information than individual unorganized 
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polymorphisms. Haplotypes provide an accurate measurement of the genomic variation in the two 
chromosomes of an individual. 

It is well-established that many diseases are associated with specific variations in gene sequences. 
However while there are examples in which individual polymorphisms act as genetic markers for a 
S particular phenotype, in other cases an individual polymorphism may be found in a variety of genomic 
backgrounds and therefore shows no definitive coupling between ihe polymorphism and the causative site 
for the phenotype (Clark AG et al. 1998 Am J Hum Genet 63:595-6 12; Ulbrecht M et al. 2000 Am J 
Respir Crit Care Med 161: 469-74). In addition, the marker may be predictive in some populations, but 
not in other populations (Clark AG et al. 1998 supra). In these instances, a haplotype will provide a 

10 superior genetic marker for the phenotype (Clark AG et al. 1 998 supra; Ulbrecht M et al. 2000, supra; 
Ruafio G & Stephens JC Gen Eng News 19 (21), December 1999). 

Analysis of the association between each observed haplotype and a particular phenotype permits 
ranking of each haplotype by its statistical power of prediction for the phenotype. Haplotypes found to be 
strongly associated with the phenotype can then have that positive association confirmed by alternative 

15 methods to minimize £alse positives. For a gene suspected to be associated with a particular phenotype, if 
no observed haplotypes for that gene show association with the phenotype of interest, then it may be 
inferred that variation in the gene has little, if any, involvement with that phenotype (Ruano & Stephens 
1999, supra). Thus, information on the observed haplotypes and their frequency of occurrence in various 
population groups will be useful in a variety of research and clinical applications. 

20 One possible drug target for the treatment of osteoporosis and other disorders caused by 

abnormal osteoclast recruitment and function is the Osteoclastogenesis Inhibitory Factor (TNFRSFl IB) 
gene or its encoded product. TNFRSFl IB is also known as Osteoprotegerin, (OPG) (Simonet et al., 90 
Cell 89(2):309-3l9, 1997) and TNF receptor-lite molecule 1 (TRl) (Takahashi et al., Endocr. Rev. 
20:345-357, 1999), and is also sometimes referred to by the symbols OCIF or OIF. TNFRSFl IB is a 

25 soluble member of the tumor necrosis factor receptor (TNFR) superfMnily and binds to at least one TNF- 
rclated cytokine, RANKL (also known as TRANCE, OFGL, ODF), which stimulates differentiation of 
osteoclasts, which are the bone resorbing cells in the body, from osteoclast precursors by binding to 
RANK, a TNF receptor family member (Tsuda E. & Higashio K 1 998 Nippon Rinsho 56: 1435-9). In 
vitro studies have shown that TNFRSFl IB neutralizes RANKL-induced osteoclastogenesis by binding to 

30 RANKL, suggesting that TNFRSFl IB is actually a secreted "decoy" receptor for RANKL that blocks 
initiation of a critical RANK-RANKL signal transduction pathway within osteoclast precursor cells 
(Takahshi et al., Biochem. Biophys. Res. Commun. 256:449-455, 1999). As a result of this blocking 
action, the number of mature osteoclasts is decreased. In vivo, TNFRSFl IB increases bone mineral 
density and bone volume in normal rats (Nippon Rinsho 56: 1435-1439, 1998), and also exhibits 

35 hypocalcemic effects in normal mice and in hypercalcemic nude mice carrying tumors associated with 
humoral hypercalcemia of malignancy (Bone 23:495-498, 1998). Also, it was reported that TNFRSFl IB 
knock-out mice develop severe osteoporosis due to enhanced osteoclastogenesis when they grew to be 
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adults (Mizuno et al., Biochem. Biophys. Res. Commun. 247:610-615, 1998). Thus, along with RANKL 
and RANK, TT'iJFRSFl IB is one of the key molecules that regulate osteoclast recruitment and function, 
and as such, an understanding of variation in the TNFRSFl IB gene should be useful in developing new 
therapies for metabolic diseases caused by abnormal osteoclast recruitment and function such as 
S osteopetrosis, osteoporosis, metastatic bone disease, Paget's disease, rheumatoid arthritis, and periodontal 
bone disease. 

The human, mouse and rat TNFRSFl IB proteins are all 401 amino acids in length, with human 
and rat TNFRSFl IB having 94% amino acid (a.a.) sequence identity (Akatsu et al. 1998 Bone 23: 495-8). 
Within the TNFR superfamily, TNFRSFl IB is most similar to TNFRII and CD40, in that this secreted 
10 protein has no transmembrane segment, and circulates as a disulfide-linked homodimer. TNFRSFl IB 
has four cysteine-rich domains and two death domain homologous regions present in tandem at the C- 
terminal portion of the protein (Morinaga et al., Eur. J. Biochem. 254:6850691, 1998; Mizuno et al., Gene 
214:339-343, 1998). 

Human TNFRSFl IB is encoded by a single-copy gene having five exons and four introns which 

1 S span 29 kilobases (kb) on chromosome 8q24 of the human genome (Morinaga et al., Eur. J. Biochem. 
254:685-691, 1998). Although the full genomic sequence has not been published, a reference sequence 
for this gene comprises the partial sequences shown in Fig. 1 (GenBank Accession No. AB008821.1; 
SEQ ID NO: 1 ), which includes the 5 ' untranslated region and the coding sequence for exon 1 , and Fig. 2 
(GenBank Accession No. AB008822.1; SEQ ID N0:2), which includes the coding sequences for exons 2- 

20 5 as well as the 3 ' untranslated region. Reference sequences for an TNFRSFl IB cDNA and protein are 
shown in Figures 3 (SEQ ID N0:3) and 4 (SEQ ID N0:4), respectively. 

Expression regulatory elements identified in this nearly complete genomic sequence include (1) a 
major transcription initiation site located 67 nucleotides (nt) upstream of the initiation ATG codon, (2) a 
translation-termination codon in exon 5, and (3) a typical poly(A)-addition signal located 173 nt 

25 downstream of the translation-termination codon (Morinaga et al., supra). Two TNFRSFl IB transcripts 
of 4.2 kb and 6.5 kb have been detected in IMR-90 cells, with the shorter transcript containing the 3' half 
of intron 2 and the longer transcript containing all of intron 2 (Morinaga et al., supra). 

One group has reported finding single nucleotide polymorphisms (SNPs) at positions 9 and 22 of 
exon 1, which correspond to positions 1 181 and 1 194 of GENBANK Accession #ABO08821.1 (Yasuda et 

30 al.. Endocrinology 139:1329-1337, 1997; GENBANK Acc. No. E15271.1). The SNPs at these sites 
would result in variation in die encoded amino acid sequence at position 3 (lysine or asparagine) and/or 
position 8 (alanine or sraine), respectively, depending on the particular combination of nucleotides found 
at these polymorphic sites in one of the two copies of the TNFRSFl IB gene from an individual. 

Because of the potential for polymorphisms in the TNFRSFl IB gene to affect the expression and 

35 function of the encoded protein, it would be useful to determine whether additional polymorphisms exist 
in the TNFRSFl IB gene, as well as how such polymorphisms are combined in different copies of the 
gene. Such information would be useful for studying the biological fimction of TNFRSFl IB as well as in 
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identifyiag drugs targeting this protein for the treatment of disorders related to its abnormal expression or 
iiinction. 



SUMMARY OF THE INVENTION i 
5 Accordingly, the inventors herein have discovered 24 novel polymorphic sites in the 

TNFRSFl IB gene. These polymorphic sites (PS) correspond to the following nucleotide positions in the 
indicated GenBank Accession Number: 491 (PSl), 676 (PS2), 889 (PS3), 916 (PS4). 950 (PS5). 1217 
(PS7), 1294 (PS8) and 1390 (PS9) in AB008821; 505 (PS 10). 668 (PSl I), 4397 (PS12),4501 (PS13), 
6601 (PS14), 6824 (PS15), 6839 (PS16), 6845 (PS17). 6893 (PS18), 6950 (PS19), 8258 (PS20), 8391 

1 0 (PS2 1), 8622 (PS22), 8694 (PS23), 8955 (PS24) and 9003 (PS25) in AB008822. The polymorphisms at 
these sites are guanine or adenine at PS 1 , guanine or diymine at PS2, cytosine or thymine at PS3, guanine 
or thymine at PS4, thymine or cytosine at PSS, cytosine or thymine at PS7, guanine or adenine at PS8, 
cytosine or adenine at PS9, cytosine or thymine at PS 10, thymine or cytosine at PSl 1, thymine or 
cytosine at PS12, cytosine or thymine at PS13, guanine or adenine at PS14, cytosine or thymine at PSl 5, 

1 5 guanine or adenine in PSl 6, adenine or guanine at PS17, adenine or guanine at PSl 8, adenine or cytosine 
at PSl 9, guanine or adenine at PS20, thymine or cytosine at PS21. guanine or thymine at PS22, guanine 
or adenine at PS23, adenine or thymine at PS24, and thymine or cytosine at PS25. In addition, the 
inventors confirmed the presence of the previously reported polymorphic site at nucleotide 1181 (PS6) in 
Figure 1. The polymorphic site at 1 194 was not detected in the experiments described herein. It is 

20 believed that TNFRSFl IB-encoding polynucleotides containing one or more of the novel polymorphic 
sites reported herein will be useful in studying the expression and biological function of TNFRSFl IB, as 
well as in developing drugs targeting this protein. In addition, information on the combinations of 
polymorphisms in the TNFRSFl IB gene may have diagnostic and forensic applications. 

Thus, in one embodiment, the invention provides an isolated polynucleotide comprising a 

25 nucleotide sequence which is a polymorphic variant of a reference sequence for the TNFRSFl IB gene or 
a fragment thereof. The reference sequence comprises SEQ ID NOS: 1-2 and the polymorphic variant 
comprises at least one polymorphism selected fi-om the group consisting of adenine at PSl, thymine at 
PS2. thymine at PS3, thymine at PS4, cytosine at PSS, thymine at PS7, adenine at PSS, adenine at PS9, 
thymine at PSIO, cytosine at PSl 1, cytosine at PS12, thymine at PS13, adenine at PS14, thymine at PS15, 

30 adenine at PS16, guanine at PS17, guanine at PS18, cytosine at PS19, adenine at PS20, cytosine at PS21, 
thymine at PS22, adenine at PS23, thymine at PS24 and cytosine at PS2S. In a preferred embodiment, the 
polymorphic variant conqirises an additional polymorphism of cytosine at PS6. A particularly preferred 
polymorphic variant is a naturally-occurring isoform (also referred to herein as an "isogene") of the 
TNF^Fl IB gene. A TNFRSFl IB isogene of the invention comprises guanine or adenine at PSl , 

35 guanine or thymine at PS2, cytosine or thymine at PS3, guanine or thymine at PS4, thymine or cytosine at 
PSS, cytosine or thymine at PS7, guanine or adenine at PSS, cytosine or adenine at PS9, cytosine or 
thymine at PSIO, thymine or cytosine at PSl 1, thymine or cytosine at PS12, cytosine or thymine at PS13, 
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guanine or adenine atPS14, cytosine or thymine at PS15, guanine or adenine in PS16, adenine or guanine 
at PS 17, adenine or guanine at PS 18, adenine or cytosine at PS 19, guanine or adenine at PS20, thymine or 
cytosine at PS21, guanine or thymine at PS22, guanine or adenine at PS23, adenine or thymine at PS24, 
and thymine or cytosine at PS25. The invention also provides a collection of TNFRSFl IB isogenes, 
5 referred to herein as a TNFRSF 1 1 B genome anthology. 

A TNFRSFl IB isogene may be defined by the combination and order of these polymorphisms in 
the isogene, which is referred to herein as a TNFRSF IIB haplotype. Thus, the invention also provides 
data on the number of different TNFRSFl IB haplotypes found in the reference populations used in the 
experiments described herein. This haplotype data is useful in methods for deriving a TNFRSFl IB 
10 haplotype from an individual's genotype for the TNFRSFl IB gene and for determining an association 
between a TNFRSFl IB haplotype and a particular trait. 

Li another embodiment, the invention provides a polynucleotide comprising a polymorphic 
variant of a reference sequence for a TNFRSFl IB cDNA or a fragment thereof. The reference sequence 
comprises SEQ ID N0:3 (Fig. 3) and the polymorphic cDNA comprises at least one polymorphism 
15 selected from the group consisting of thymine at a position corresponding to nucleotide 699, adenine at a 
position corresponding to nucleotide 714, guanine at a position corresponding to nucleotide 720, guanine 
at a position corresponding to nucleotide 768, adenine at a position corresponding to nucleotide 841, 
thymine at a position corresponding to nucleotide 1 1 02 and cytosine at a position corresponding to 
nucleotide 1 150. In a preferred embodiment, the polymorphic variant comprises an additional 
20 polymorphism of cytosine at a position corresponding to nucleotide 9 in Figure 3. 

Polynucleotides complementary to these TNFRSFl IB genomic and cDNA variants are also 
provided by the invention. 

In other embodiments, the invention provides a recombinant expression vector con^ristng one of 
the polymorphic genomic variants operably linked to expression regulatory elements as well as a 
25 recombinant host cell transformed or transfected with the expression vector. The recombinant vector and 
host cell may be used to express TNFRSFl IB for protein structure analysis and drug binding studies. 

In yet another embodiment, the invention provides a polypeptide comprising a polymorphic 
variant of a reference amino acid sequence for the TNFRSFl IB protein. The reference amino acid 
sequence comprises SEQ ID N0:4 (Fig. 4) and the polymorphic variant comprises at least one variant 
30 amino acid selected from the group consisting of methionine at a position corresponding to amino acid 
position 240, methionine at a position corresponding to amino acid position 281, and serine at a position 
corresponding to amino acid 368. In some embodiments, the polymorphic variant also comprises 
asparagine at a position corresponding to amino acid 3 in Figure 4. A polymorphic variant of 
TNFRSFl IB is useful in studying the effect of the variation on the biological activity of TNFRSFl IB as 
35 well as studying the binding affinity of candidate drugs targeting TNFRSFl IB for the treatment of 
osteoporosis and other disorders caused by abnormal osteoclast recruitment and function. 

The present invention also provides antibodies that recognize and bind to the above polymorphic 

5 
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TNFRSFl IB protein variant. Such antibodies can be utilized in a variety of diagnostic and prognostic 
formats and therapeutic methods. 

In other embodiments, the invention provides methods, compositions, and kits for haplotyping 
and/or genotyping the TNFRSFl IB gene in an individual. The methods involve identifying the 
S nucleotide or nucleotide pair present at one or more polymorphic sites selected from PS 1 -S,PS7-2S in one 
or bodi copies of the TNFRSFl IB gene from the individual. The compositions contain oligonucleotide 
probes and primers designed to specifically hybridize to one or more target regions containing, or that are 
adjacent to, a polymorphic site. The methods and compositions for establishing the genotype or 
haplotype of an individual at the novel polymorphic sites described herein are useful for studying the 

10 effect of the polymorphisms in the etiology of diseases affected by the expression and function of the 
TNFRSFl IB protein, studying the efficacy of drugs targeting TNFRSFl IB, predicting individual 
susceptibility to diseases affected by the expression and fiinction of the TNFRSFl IB protein and 
predicting individual responsiveness to drugs targeting TNFRSFl IB. 

In yet another embodiment, the invention provides a method for identifying an association 

IS between a genotype or haplotype and a trait. In preferred embodiments, the trait is susceptibility to a 
disease, severity of a disease, the staging of a disease or response to a drug. Such methods have 
applicability in developing diagnostic tests and therapeutic treatments for osteoporosis and other disorders 
caused by abnormal osteoclast recruitment and function. 

The present invention also provides transgenic animals comprising one of the TNFRSFl IB 

20 genomic polymorphic variants described herein and methods for producing such animals. The transgenic 
animals are useful for studying expression of the TNFRSFl IB isogenes in vivo, for in vivo screening and 
testing of drugs targeted against TNFRSFl IB protein, and for testing the efficacy of therapeutic agents 
and compounds for osteoporosis and other disorders caused by abnormal osteoclast recruitment and 
function in a biological system. 

25 The present invention also provides a computer system for storing and displaying polymorphism 

data determined for the TNFRSFl IB gene. The computer system comprises a computer processing unit; 
a display; and a database containing the polymorphism data. The polymorphism data includes the 
polymorphisms, the genotypes and the haplotypes identified for the TNFRSFl IB gene in a reference 
population. In a preferred embodiment, the conputer system is capable of producing a display showing 

30 TNFRSFl IB haplotypes organized according to their evolutionary relationships. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates a partial reference sequence for the TNFRSFl IB gene (Genbank Version 
Number AB008821.1; contiguous lines; SEQ ID N0:1), with the underline indicating the start codon, 
35 shading indicating the reference coding sequence, and bold nucleotides indicating the polymorphic sites 
and polymorphisms identified by Applicants in a reference population. 

Figure 2 illustrates a partial reference sequence for the TNFRSFl 1 B gene (Genbank Version 
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Number AB008822.1; contiguous lines; SEQ ID N0:2), with the underline indicating the stop codon, 
shading indicating the reference coding sequence, and bold nucleotides indicating the polymorphic sites 
and polymorphisms identified by Applicants in a reference population. 

Figure 3 illustrates a reference sequence for the TNFRSFl IB coding sequence (GENBANK 
S ACC# AB002 146; contiguous lines; SEQ ID N0:3) with underlines indicating the start and stop codons, 
and bold nucleotides indicating the polymorphic sites and polymoipMstns identified by Applicants in a 
reference population. 

Figure 4 illustrates a reference sequence for the TNFRiSFl IB protein (GENBANK ACC # 
BAA2S910; contiguous lines; SEQ ID N0:4) with the bold amino acids indicating the amino acid 
10 variations caused by the polymorphisms of Figure 3. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention is based on the discovery of novel variants of the TNFRSFl IB gene. As 
described in more detail below, the inventors herein discovered 24 novel polymorphic sites by 

1 5 characterizing the TNFRSF 1 1 B gene found in genomic DNAs isolated from Index Rqjositoiy lA that 
contains immortalized cell lines from one chimpanzee and 93 human individuals and Index Repository IB 
that contains 70 human individuals. These two repositories contain 5 1 individuals in common. 

The human individuals in Index Repository lA included a reference population of 79 unrelated 
individuals self-identified as belonging to one of four major population groups: Caucasian (22 

20 individuals), African descent (20 individuals) Asian (20 individuals) Hispanic/Latino (17 individuals). 
To the extent possible, the members of this reference population were organized into population 
subgroups by the self-identified ethnogeographic origin of their four grandparents as shown in Table 1 
below. In addition. Index Repository lA contains three unrelated indigenous American Indians (one from 
each of North, Central and South America), one three-generation Caucasian family (firom the CEPH Utah 

25 cohort) and one two-generation African-American family. 
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Table 1. Population Groups in Index Repository lA 



Population Group 


Population Subgroup 


No. of Individuals 


African descent 




20 




Sierra Leone 


1 


Asian 




20 




Burma 


1 




China 


3 




Japan 


6 




Korea 


1 




Philippines 


5 




Vietnam 


4 


Caucasian 




22 




British Isles 


3 




British Isles/Central 


4 




British Isles/Eastern 


1 




Central/Eastem 


I 




Eastern 


3 




Central/Mediterranean 


1 




Mediterranean 


2 




Scandinavian 


2 


Hispanic/Latino 




17 




Caribbean 


7 




Caribbean (Spanish Descent) 


2 




Central American (Spanish Descent) 


1 




Mexican American 


4 




South American (Spanish Descent) 


3 



Index Repository IB contains a reference population of 70 human individuals comprised of 4 
5 three-generation famiUes (from the CEPH Utah cohort) as well as unrelated African-American, Asian and 
Caucasian individuals. A total of 38 individuals in this reference population are unrelated. 

Using the TNFRSFl IB genotypes identified in the Index Repositories and the methodology 
described in the Examples below, the inventors herein also determined the haplotypes found on each 
chromosome for most human members of these repositories. The TNFRSFl IB genotypes and haplotypes 
10 found in the Index Repositories include those shown in Tables 4 and 5, respectively. The polymorphism 
and haplotype data disclosed herein are useful for studying population diversity, anthropological lineage, 
the significance of diversity and lineage at the phenotypic level, paternity testing, forensic applications, 
and for identifying associations between the TNFRSFl IB genetic variation and a trait such as level of 
drug response or susceptibility to disease. 
IS In the context of this disclosure, the following terms shall be defined as follows unless otherwise 

indicated: 

Allele - A particular form of a genetic locus, distinguished from other forms by its particular 
nucleotide sequence. 

Candidate Gene - A gene which is hypothesized to be responsible for a disease, condition, or the 
20 response to a treatment, or to be correlated with one of these. 

8 
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Gene - A segment of DNA that contains all the information for the regulated biosynthesis of an 
RNA product, including promoters, exons, introns, and other untranslated regions that control expression. 

Genotype - An unphased 5 ' to 3 ' sequence of nucleotide pair(s) found at one or more 
polymorphic sites in a locus on a pair of homologous chromosomes in an individual. As used herein, 
5 genotype includes a full-genotype and/or a sub-genotype as described below. 

Full-genotype - The unphased 5' to 3' sequence of nucleotide pairs found at all known 
polymorphic sites in a locus on a pair of homologous chromosomes in a single individual. 

Sub-genotype - The unphased 5' to 3' sequence of nucleotides seen at a subset of the known 
polymorphic sites in a locus on a pair of homologous chromosomes in a single individual. 
10 Genotyping - A process for determining a genotype of an individual. 

Haplotype - A 5' to 3' sequence of nucleotides found at one or more polymorphic sites in a locus 
on a single chromosome from a single individual. As used herein, haplotype includes a fitll-haplotype 
and/or a sub-haplotype as described below. 

FuU-haplotype - The S' to 3' sequence of nucleotides found at all known polymorphic sites in a 
IS locus on a single chromosome from a single individual. 

Snb-haplotype - The 5 ' to 3 ' sequence of nucleotides seen at a subset of the known polymorphic 
sites in a locus on a single chromosome from a single individual. 

Haplotype pair - The two haplotypes found for a locus in a single individual. 

Haplotyping - A process for determining one or more haplotypes in an individual and includes 
20 use of family pedigrees, molecular techniques and/or statistical inference. 

Haplotype data - Information concerning one or more of the following for a specific gene: a 
listing of the haplotype pairs in each individual in a population; a hsting of the different haplotypes in a 
population; frequency of each haplotype in that or other populations, and any known associations between 
one or more haplotypes and a trait. 
25 Isoform - A particular form of a gene, mRNA, cDNA or the protein encoded thereby, 

distinguished from other forms by its particular sequence and/or structure. 

Isogene - One of the isoforms of a gene found in a population. An isogene contains all of the 
polymorphisms present in the particular isoform of the gene. 

Isolated - As appUed to a biological molecule such as RNA, DNA, oligonucleotide, or protein, 
30 isolated means the molecule is substantially free of other biological molecules such as nucleic acids, 

proteins, lipids, carbohydrates, or other material such as cellular debris and growth media. Generally, the 
term "isolated" is not intended to refer to a complete absence of such material or to absence of water, 
buffers, or salts, unless they are present in amounts that substantially interfere with the methods of the 
present invention. 

35 Locus - A location on a chromosome or DNA molecule corresponding to a gene or a physical or 

phenotypic feature. 

Naturally-occurring - A term used to designate that the object it is applied to, e.g., naturally- 
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occurring polynucleotide or polypeptide, can be isolated from a source in nature and which has not been 
intentionally modified by man. 

Nucleotide pair - The nucleotides found at a polymorphic site on the two copies of a 
chromosome from an individual. 
5 Phased - As applied to a sequence of nucleotide pairs for two or more polymorphic sites in a 

locus, phased means the combination of nucleotides present at those polymorphic sites on a single copy of 
the locus is known. 

Polymorphic site (PS) - A position within a locus at which at least two alternative sequences are 
found in a population, the most frequent of which has a frequency of no more than 99%. 
10 Polymorphic variant - A gene, mRNA cDNA, polypeptide or peptide whose nucleotide or 

amino acid sequence varies from a reference sequence due to the presence of a polymorphism in the gene. 

Polymorphism - The sequence variation observed in an individual at a polymorphic site. 
Polymorphisms include nucleotide substitutions, insertions, deletions and microsatellites and may, but 
need not, result in detectable differences in gene expression or protein function. 
1 3 ' Polymorphism data - Information concerning one or more of the following for a specific gene: 
location of polymorphic sites; sequence variation at those sites; frequency of polymorphisms in one or 
more populations; the different genotypes and/or haplotypes determined for the gene; frequency of one or 
more of these genotypes and/or haplotypes in one or more populations; any known association(s) between 
a trait and a genotype or a haplotype for the gene. 
20 Polymorphism Database - A collection of polymorphism data arranged in a systematic or 

methodical way and capable of being individually accessed by electronic or other means. 

Polynucleotide - A nucleic acid molecule comprised of single-stranded RNA or DNA or 
comprised of con^lementary, double-stranded DNA. 

Population Group - A group of individuals sharing a conmion ethnogeographic origin. 
25 Reference Population - A group of subjects or individuals who are predicted to be representative 

of the genetic variation found in the general population. Typically, the reference population represents 
the genetic variation in the population at a certainty level of at least 85%, preferably at least 90%, more 
preferably at least 95% and even more preferably at least 99%. 

Single Nucleotide Polymorphism (SNP) - Typically, the specific pair of nucleotides observed at 
30 a single polymorphic site. In rare cases, three or four nucleotides may be foimd. 

Subject - A human individual whose genotypes or haplotypes or response to treatment or disease 
state are to be determined. 

Treatment - A stimulus administered internally or externally to a subject. 
Vnphased - As applied to a sequence of nucleotide pairs for two or more polymorphic sites in a 
35 locus, imphased means the combination of nucleotides present at those polymorphic sites on a single copy 
of the locus is not known. 

The inventors herein have discovered 24 novel polymorphic sites, and confirmed the existence of 
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a 25* site, in the T7»IFRSF1 IB gene. The polymorphic sites identified by the inventors are referred to as 
PS 1-25 to designate the order in which they are located in the gene (see Table 3 below), with the novel 
polymorphic sites referred to as PSl, PS2, PS3, PS4, PS5, PS7, PS8, PS9. PSIO, PSll, PS12, PS13, 
PS14, PS15, PS16, PS17, PS18, PS19, PS20, PS21, PS22, PS23, PS24, and PS 25, and the previously 
5 reported polymorphic site referred to as PS6. 

Thus, in one embodiment, the invention provides an isolated polynucleotide comprising a 
polymorphic variant of the TNFRSFl IB gene or a fragment of the gene which contains at least one of the 
novel polymorphic sites described herein. The nucleotide sequence of a variant TNFRSFl IB gene is 
identical to the reference genomic sequence for those portions of the gene examined, as described in the 

10 Examples below, except that it comprises a different nucleotide at one or more of the novel polymorphic 
sites selected from the group consisting of PSl, PS2, PS3, PS4, PS5, PS7, PS8, PS9, PSIO, PSl 1, PS 12, 
PS13, PS14, PS15, PS16, PS17, PS18, PS19, PS20, PS21, PS22, PS23, PS24, and PS 25, and may also 
comprise an additional polymorphism of cytosine at PS6. Similarly, the nucleotide sequence of a variant 
fragment of the TNFRSFl IB gene is identical to the corresponding portion of the reference sequence 

1 5 except for having a different nucleotide at one or more of the novel polymorphic sites described herein. 
Thus, the invention specifically does not include polynucleotides comprising a nucleotide sequence 
identical to the reference sequence (or other reported TNFRSFl IB sequences) or to portions of the 
reference sequence (or other reported TNFRSFl IB sequences), except for genotyping oligonucleotides as 
described below. 

20 The location of a polymorphism in a variant gene or fragment is identified by aligning its 

sequence against SEQ ID NOS:l-2. The polymorphism is selected from the group consisting of adenine 
at PSl, thymine at PS2, thymine at PS3, thymine at PS4, cytosine at PS5, thymine at PS7, adenine at PS8, 
adenine at PS9, thymine at PSIO, cytosine at PSl 1, cytosine at PS12, thymine at PS13, adenine at PS14, 
thymine at PS15, adenine at PS16, guanine at PS17, guanine at PS18, cytosine at PS19, adenine at PS20, 

25 cytosine at PS21, thymine at PS22, adenine at PS23, thymine at PS24 and cytosine at PS25. In a 
preferred embodiment, the polymorphic variant comprises a naturally-occurring isogene of the 
TNFRSFl IB gene which is defmed by any one of haplotypes 1-27 shown in Table 5 below. 

Polymorphic variants of the invention may be prepared by isolating a clone containing the 
TNFRSFl IB gene from a human genomic library. The clone may be sequenced to determine the identity 

30 of the nucleotides at the polymorphic sites described herein. Any particular variant claimed herein could 
be prepared from this clone by performing in vitro mutagenesis using procedures well-known in the art. 

TNFRSFl IB isogenes may be isolated using any method that allows separation of the two 
"copies" of the TNFRSFl IB gene present in an individual, which, as readily understood by the skilled 
artisan, may be the same allele or different alleles. Separation methods include targeted in vivo cloning 

35 (TIVC) in yeast as described in WO 98/01573, U.S. Patent No. 5,866,404, and copending U.S. 

application Serial No. 08/987,966. Another method, which is described in copending U.S. Application 
Serial No. 08/987,966, uses an allele specific oligonucleotide in combination with primer extension and 
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exonuclease degradation to generate hemizygous DNA targets. Yet other methods are single molecule 
dilution (SMD) as described in Ruafio et al., Proc. Natl. Acad. Sci. 87:6296-6300, 1990; and allele 
specific PCR (Ruafio et al., 17 Nucleic Acids. Res. 8392, 1989; Ruafio et al., 19 Nucleic Acids Res. 6877- 
6882, 1991; Michalatos-Beloin et al., 24 Nucleic Acids Res. 484 M843, 1996). 
S The invention also provides TNFRSFl IB genome anthologies, which are collections of 

TNFRSFl IB isogenes found in a given population. The population may be any group of at least two 
individuals, including but not limited to a reference population, a population group, a family population, a 
clinical population, and a same sex population. A TNFRSFl IB genome anthology may comprise 
individual TNFRSFl IB isogenes stored in separate containers such as microtest tubes, separate wells of a 
10 microtitre plate and the like. Alternatively, two or more groups of the TNFRSFl IB isogenes in the 
anthology may be stored in separate containers. Individual isogenes or groups of isogenes in a genome 
anthology may be stored in any convenient and stable form, including but not limited to in buffered 
solutions, as DNA precipitates, fireeze-dried preparations and the like. A preferred TNFRSFl IB genome 
anthology of the invention comprises a set of isogenes defined by the haplotypes shown in Table S below. 

15 An isolated polynucleotide containing a polymorphic variant nucleotide sequence of the invention 

may be operably linked to one or more expression regulatory elements in a recombinant expression vector 
capable of being propagated and expressing the encoded TNFRSFl IB protein in a prokaryotic or a 
eukaryotic host cell. Examples of expression regulatory elements which may be used include, but are not 
limited to, the lac system, operator and promoter regions of phage lambda, yeast promoters, and 

20 promoters derived firom vaccinia virus, adenovirus, retroviruses, or SV40. Other regulatory elements 

include, but are not limited to, appropriate leader sequences, termination codons, polyadenylation signals, 
and other sequences required for the appropriate transcription and subsequent translation of the nucleic 
acid sequence in a given host cell. Of course, the correct combinations of expression regulatory elements 
will depend on the host system used. In addition, it is understood that die expression vector con^ins any 

25 additional elements necessary for its transfer to and subsequent replication in the host cell. Examples of 
such elements include, but are not limited to, origins of replication and selectable markers. Such 
expression vectors are commercially available or are readily constructed using methods known to those in 
the art (e.g., F. Ausubel et al., 1 987, in "Current Protocols in Molecular Biology", John Wiley and Sons, 
New York, New York). Host cells which may be used to express the variant TNFRSFl IB sequences of 

30 file invention include, but are not limited to, eukaryotic and mammalian cells, such as animal, plant, insect 
and yeast cells, and prokaryotic cells, such as E. coU, or algal cells as known in the art. The recombinant 
expression vector may be introduced into the host cell using any method known to those in the art 
including, but not limited to, microinjection, electroporation, particle bombardment, transduction, and 
transfection using DEAE-dextran, lipofection, or calcium phosphate (see e.g., Sambrook et al. (1989) in 

35 "Molecular Cloning. A Laboratory Manual", Cold Spring Harbor Press, Plainview, New York). In a 
preferred aspect, eukaryotic expression vectors that function in eukaryotic cells, and preferably 
mammalian cells, are used. Non-limiting examples of such vectors include vaccinia virus vectors. 
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adenovirus vectors, herpes virus vectors, and baculovirus transfer vectors. Preferred eukaryotic cell lines 
include COS cells, CHO cells, HeLa cells, NIH/3T3 cells, and embryonic stem cells (Thomson, J. A. et 
al., 1 998 Science 282: 1 1 45-1 1 47), Particularly preferred host cells are mammalian cells. 

As will be readily recognized by the skilled artisan, expression of polymorphic variants of the 
5 TNFRSFl IB gene will produce TNFRSFl IB mRNAs varying from each other at any polymorphic site 
retained in the spliced and processed mRNA molecules. These mRNAs can be used for the preparation of 
a TNFRSFl IB cDNA comprising a nucleotide sequence which is a polymorphic variant of the 
TNFRSFl IB reference coding sequence shown in Figure 3. Thus, the invention also provides 
TNFRSFl IB mRNAs and corresponding cDNAs which comprise a nucleotide sequence that is identical 

10 to SEQ ID N0:3 (Fig. 3), or its corresponding RNA sequence, except for having one or more 

polymorphisms selected from the group consisting of thymine at a position corresponding to nucleotide 
699, adenine at a position corresponding to nucleotide 714, guanine at a position corresponding to 
nucleotide 720, guanine at a position coiresponding to nucleotide 768, adenine at a position 
corresponding to nucleotide 841, thymine at a position corresponding to nucleotide 1 102 and cytosine at a 

15 position corresponding to nucleotide 1 150, and may also comprise an additional polymorphism of 

cytosine at a position corresponding to nucleotide 9 in Figure 3. Fragments of these variant mRNAs and 
cDNAs are included in the scope of the invention, provided they contain the novel polymorphisms 
described herein. The invention specifically excludes polynucleotides identical to previously identified 
and characterized TNFRSFl IB cDNAs and fragments thereof. Polynucleotides comprising a variant 

20 RNA or DNA sequence may be isolated from a biological sample using well-known molecular biological 
procedures or may be chemically synthesized. 

Genomic and cDNA fragments of the invention comprise at least one novel polymorphic site 
identified herein and have a length of at least 10 nucleotides and may range up to the full length of the 
gene. Preferably, a fragment according to the present invention is between 100 and 3000 nucleotides in 

25 length, and more preferably between 200 and 2000 nucleotides in length, and most preferably between 
500 and 1000 nucleotides in length. 

In describing the polymorphic sites identified herein, reference is made to the sense strand of the 
gene for convenience. However, as recognized by the skilled artisan, nucleic acid molecules containing 
the TNFRSFl IB gene may be complementary double stranded molecules and thus reference to a 

30 particular site on the sense strand refers as well to the corresponding site on the complementary antisense 
strand. Thus, reference may be made to the same polymorphic site on either strand and an 
oligonucleotide may be designed to hybridize specifically to either strand at a target region containing the 
polymorphic site. Thus, the invention also includes single-stranded polynucleotides which are 
complementary to the sense strand of the TNFRSFl IB genomic variants described herein. 

35 Polynucleotides comprising a polymorphic gene variant or fragment may be usefiil for 

therapeutic purposes. For example, where a patient could benefit from expression, or increased 
expression, of a particular TNFRSFl IB protein isoform, an expression vector encoding the isoform may 
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be administered to the patient. The patient may be one who lacks the TNFRSFl IB isogene encoding that 
isoform or may ah-eady have at least one copy of that isogene. 

In other situations, it may be desirable to decrease or block expression of a particular 
TNFRSFl IB isogene. Expression of a TNFRSFl IB isogene may be turned off by transforming a 
S targeted organ, tisstie or cell population with an expression vector that expresses high levels of 

untranslatable mRNA for the isogene. Alternatively, oligonucleotides directed against the regulatory 
regions (e.g., promoter, introns, enhancers, 3 ' untranslated region) of the isogene may block transcription. 
Oligonucleotides targeting the transcription initiation site, e.g., between positions -10 and +10 from the 
start site are preferred. Similarly, inhibition of transcription can be achieved using oligonucleotides that 

10 base-pair with region(s) of the isogene DNA to form triplex DNA (see e.g., Gee et al. in Huber, B.E. and 
B.I. Cair, Molecular and Immunologic Approaches, Futura Publishing Co., Mt. Kisco, N.Y., 1994). 
Antisense oligonucleotides may also be designed to block translation of TNFRSFl IB mRNA transcribed 
from a particular isogene. It is also contemplated that ribozymes may be designed that can catalyze the 
specific cleavage of TNFRSFl IB mRNA transcribed from a particular isogene. 

1 5 The oligonucleotides may be delivered to a target cell or tissue by expression fi-om a vector 

introduced into the cell or tissue in vivo or ex vivo. Alternatively, the oligonucleotides may be formulated 
as a pharmaceutical composition for administration to the patient. Oligoribonucleotides and/or 
oligodeoxynucleotides intended for use as antisense oligonucleotides may be modified to increase 
stability and half-life. Possible modifications include, but are not limited to phosphorothioate or 2' O- 

20 methyl linkages, and the inclusion of nontraditional bases such as inosine and queosine, as well as acetyl-, 
methyl-, thio-, and similarly modified forms of adenine, cytosine, guanine, thymine, and uracil which are 
not as easily recognized by endogenous nucleases. 

The invention also provides an isolated polypeptide comprising a polymorphic variant of the 
reference TNFRSFl IB amino acid sequence shown in Figure 4 (SEQ ID N0:4). The location of a variant 

25 amino acid in a TNFRSF 1 IB polypeptide or fragment of the invention is identified by aligning its 
sequence against Fig. 4. A TNFRSFl IB protein variant of ttie invention comprises an amino acid 
sequence identical to SEQ ID NO: 4 except for having one or more variant amino acids selected from the 
group consisting of methionine at a position corresponding to amino acid position 240, methionine at a 
position corresponding to amino acid position 281, and serine at a position corresponding to amino acid 

30 368, and may also comprise an additional variant amino acid of asparagine at a position corresponding to 
amino acid 3 in Figure 4. The invention specifically excludes amino acid sequences identical to those 
previously identified for TNFRSFl IB, including SEQ ID NO: 4, and previously described fragments 
thereof. TNFRSFl IB protein variants included within the uivention comprise all amino acid sequences 
based on SEQ ID NO: 4 and having the combination of amino acid variations described in Table 2 below. 

35 In preferred embodiments, a TNFRSFl IB protein variant of the invention is encoded by an isogene 
defined by one of the observed haplotypes shown in Table 5. 
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Table 2. Novel Protein Variants of TNFRSFllB 


Isoform No. 


Amino Acid Positions 


3 


240 


281 


3oo 


1 


XT 

N 


M 


X It 

M 


c 


2 


XT 


M 


M 


T 


■J 
J 


XT 


A It 

M 


V 


o 


4 


XT 


M 


V 


1 


3 


XI 


T 
I 


iVl 


O 


0 


N 


I 


M 


T 


7 


N 


I 


V 


S 


8 


K 


M 


M 


s 


9 


K 


M 


M 


T 


10 


K 


M 


V 


S 


11 


K 


M 


V 


T 


12 


K 


I 


M 


S 


13 


K 


I 


M 


T 


14 


K 


I 


V 


S 



The invention also includes TNFRSFl IB peptide variants, which are any fragments of a 
5 TTNFRSFl IB protein variant that contains one or more of the novel amino acid variations shown in Table 
2. A TNFRSFl IB peptide variant is at least 6 amino acids in length -and is preferably any number 
between 6 and 30 amino acids long, more preferably between 10 and 25, and most preferably between IS 
and 20 amino acids long. Such TNFRSFl IB peptide variants may be useful as antigens to generate 
antibodies specific for one of the above TNFRSFl IB isoforms. In addition, the TNFRSFl IB peptide 

10 variants may be useful in drug screening assays. 

A TNFRSFl IB variant protein or peptide of the invention may be prepared by chemical synthesis 
or by expressing one of the variant TNFRSFl IB genomic and cDNA sequences as described above. 
Alternatively, the TNFRSFl IB protein variant may be isolated from a biological sample of an individual 
having a TNFRSFl IB isogene which encodes the variant protein. Where the sample contains two 

IS different TNFRSFl IB isoforms (i.e., the individual has different TNFRSFl IB isogenes), a particular 
TNFRSFl IB isoform of the invention can be isolated by immunoaffmity chromatography using an 
antibody which specifically binds to that particular TNFRSFl IB isoform but does not bind to the other 
TNFRSFl IB isoform. 

The expressed or isolated TNFRSFl IB protein may be detected by methods known in the art, 
20 including Coomassie blue staining, silver staining, and Western blot analysis using antibodies specific for 
the isoform of the TNFRSFl IB protein as discussed further below. TNFRSFl IB variant proteins can be 
purified by standard protein purification procedures known in the art, including differential precipitation, 
molecular sieve chromatography, ion-exchange chromatography, isoelectric focusing, gel electrophoresis, 
affinity and immunoaflinity chromatography and the like. (Ausubel et. al., 1987, In Current Protocols in 
25 Molecular Biology John Wiley and Sons, New York, New York). In the case of immunoaffinity 
chromatography, antibodies specific for a particular polymorphic variant may be used. 

15 



wo 01/04137 PCT/USOO/18803 
A polymorphic variant TNFRSFl IB gene of the invention may also be fused in frame with a 
heterologous sequence to encode a chimeric TNFRSFl IB protein. The non-TNFRSFl IB portion of the 
chimeric protein may be recognized by a commercially available antibody. In addition, the chimeric 
protein may also be engineered to contain a cleavage site located between the TNFRSFl IB and non- 
5 TNFRSFl IB portions so that the TNFRSF 1 1 B protein may be cleaved and purified away from the non- 
TNFRSFl IB portion. 

An additional embodiment of the invention relates to using a novel TNFRSFl IB protein isoform 
in any of a variety of drug screening assays. Such screening assays may be performed to identify agents 
that bind specifically to all known TNFRSFl IB protein isoforms or to only a subset of one or more of 

10 these isoforms. The agents may be from chemical compound libraries, peptide libraries and the like. The 
TNFRSFl IB protein or peptide variant may be free in solution or affixed to a solid support. In one 
embodiment, high throughput screening of compounds for binding to a TNFRSFl IB variant may be 
acconq)lished using the method described in PCT application WO84/03S6S, in which large numbers of 
test compounds are synthesized on a solid substrate, such as plastic pins or some other surface, contacted 

15 with the TNFRSFl IB protein(s) of interest and then washed. Bound TNFRSFl IB protein(s) are then 
detected using methods welMcnown in the art. 

In ano&er embodiment, a novel TNFRSFl IB protein isoform may be used in assays to measure 
the binding affinities of one or more candidate drugs targeting the TNFRSFl IB protein. 

In another embodiment, the invention provides antibodies specific for and iramunoreactive with 

20 one or more of the novel TNFRSFl 1 B variant proteins described herein. The antibodies may be either 
monoclonal or polyclonal in origin. The TNFRSFl IB protein or peptide variant used to generate the 
antibodies may be from natural or recombinant sources or produced by chemical synthesis using synthesis 
techniques known in the art. If the TNFRSFl IB protein variant is of insufficient size to be antigenic, it 
may be conjugated, coniplexed, or otherwise covalently linked to a carrier molecule to enhance the 

25 antigenicity of the peptide. Examples of carrier molecules, include, but are not limited to, albumins (e.g., 
human, bovine, fish, ovine), and keyhole limpet hemocyanin (Basic and Clinical Immunology, 1991, Eds. 
D.P. Stites, and A.I. Terr, Appleton and Lange, Norwalk Connecticut, San Mateo, California). 

In one embodiment, an antibody specifically immunoreactive with one of the novel TNFRSFl IB 
protein isoforms described herein is administered to an individual to neutralize activity of the 

30 TNFRSF 1 1 B isoform expressed by that individual. The antibody may be formulated as a pharmaceutical 
con^osition which includes a pharmaceutically acceptable carrier. 

Antibodies specific for and immunoreactive with one of the novel TNFRSFl IB protein isoform 
described herein may be used to immiuioprecipitate the TNFRSFl IB protein variant from solution as 
well as react with TNFRSFl IB protein isoforms on Western or immunoblots of polyacrylamide gels on 

35 membrane supports or substrates. In another preferred embodiment, the antibodies will detect 

TNFRSFl IB protein isoforms in paraffin or frozen tissue sections, or in cells which have been fixed or 
unfixed and prepared on slides, coverslips, or the like, for use in inmiimocytochemical, 



16 



wo 01/04137 PCT/USOO/18803 
immunohistochemical, and immunofluorescence techniques. 

In another embodiment, an antibody specifically immunoreactive witih one of the novel 
TNFRSFl IB protein variants described herein is used in immunoassays to detect this variant in biological 
sanqiles. In diis method, an antibody of the present invention is contacted with a biological sample and 
S the formation of a complex between the TNFRSFl IB protein variant and the antibody is detected. As 
described, suitable immunoassays include radioimmunoassay, Western blot assay, immunofluorescent 
assay, enzyme linked immunoassay (ELISA), chemiluminescent assay, immunohistochemical assay, 
immunocytochemical assay, and the like (see, e.g.. Principles and Practice of Immunoassay, 1991, Eds. 
Christopher P. Price and David J. Neoman, Stockton Press, New York, New York; Current Protocols in 

10 Molecular Biology, 1987, Eds. Ausubel et al., John Wiley and Sons, New York, New York). Standard 
techniques known in the art for ELISA are described in Methods in Immunodiagnosis, 2nd Ed., Eds. Rose 
and Bigazzi, John Wiley and Sons, New York 1980; and Campbell et al., 1984, Methods in Immunology, 
W.A. Benjamin, Lie). Such assays may be direct, indirect, competitive, or noncompetitive as described 
in the art (see, e.g.. Principles and Practice of Immunoassay, 1991, Eds. Christopher P. Price and David J. 

15 Neoman, Stockton Prcs, NY, NY; and Oellirich, M., 1 984, J. Clin. Chem. Clin. Biochem., 22:895-904). 
Proteins may be isolated from test specimens and biological samples by conventional methods, as 
described in Current Protocols in Molecular Biology, supra. 

Exemplary antibody molecules for use in the detection and therapy methods of the present 
invention are intact immunoglobulin molecules, substantially intact immunoglobulin molecules, or those 

20 portions of immunoglobulin molecules that contain the antigen binding site. Polyclonal or monoclonal 
antibodies may be produced by methods conventionally known in the art (e.g., Kohler and Milstein, 1975, 
Nature, 256:495-497; Campbell Monoclonal Antibody Technology, the Production and Characterization 
of Rodent and Human Hybridomas, 1985, In: Laboratory Techniques in Biochemistry and Molecular 
Biology, Eds. Burdon et al.. Volume 13, Elsevier Science Publishers, Amsterdam). The antibodies or 

25 antigen binding fragments thereof may also be produced by genetic engineering. The technology for 
expression of both heavy and light chain genes in E. coli is the subject of PCT patent applications, 
publication number WO 901443, WO 901443 and WO 9014424 and in Huse et al., 1989. Science, 
246:1275-1281, The antibodies may also be humanized (e.g.. Queen, C. et al. 1989 Proc. Natl. Acad. Sci. 
86; 10029). 

30 Effect(s) of the polymorphisms identified herein on expression of TNFRSFl IB may be 

investigated by prq)aring recombinant cells and/or organisms, preferably recombinant animals, 
containing a polymorphic variant of the TNFRSFIIB gene. As used herein, "expression" includes but is 
not limited to one or more of the following: transcription of the gene .into precursor mRNA; splicing and 
oflier processing of the precursor mRNA to produce mature mRNA; mRNA stability; translation of the 

35 mature mRNA into TNFRSFl IB protein (including codon usage and tRNA availability); and 

glycosylation and/or other modifications of the translation product, if required for proper expression and 
function. 
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To prepare a recombinant cell of the invention, the desired TNFRSFl IB isogene may be 
introduced into the cell in a vector such that Uie isogene remains extrachromosomal. hi such a situation, 
the gene will be expressed by the cell firom the extrachromosomal location. In a preferred embodiment, 
thp TOFRSFl IB isogene is introduced into a cell in such a way that it recombines with the endogenous 
S TNFRSFl IB gene present in the cell. Such recombination requires the occurrence of a double 

recombination event, thereby resulting in the desired TNFRSFl IB gene polymorphism. Vectors for the 
introduction of genes both for recombination and for extrachromosomal maintenance are known in the 
art, and any suitable vector or vector construct may be used in the invention. Methods such as 
electroporation, particle bombardment, calcium phosphate co-precipitation and viral transduction for 

1 0 introducing DNA into cells are known in the art; therefore, the choice of method may lie with the 
competence and preference of the skilled practitioner. Examples of cells into which the TNFRSFl IB 
isogene may be introduced include, but are not limited to, continuous culture cells, such as COS, 
NIH/3T3, and primary or culture cells of the relevant tissue type, i.e., they express the TNFRSF 1 IB 
isogene. Such recombinant cells can be used to compare the biological activities of the different protein 

IS variants. 

Recombinant organisms, i.e., transgenic animals, expressing a variant TNFRSFl IB gene are 
prepared using standard procedures known in the art. Preferably, a construct comprising the variant gene 
is introduced into a nonhuman animal or an ancestor of the animal at an embryonic stage, i.e., the one-cell 
stage, or generally not later than about the eight-cell stage. Transgenic animals carrying the constructs of 

20 the invention can be made by several methods known to those having skill in the art. One method 

involves transfecting into the embryo a retrovirus constructed to contain one or more insulator elements, a 
gene or genes of interest, and other components known to those skilled in the art to provide a complete 
shuttle vector harboring the insulated gene(s) as a transgene, see e.g., U.S. Patent No. 5,610,053. Another 
method involves directly injecting a transgene into the embryo. A third method involves the use of 

25 embryonic stem cells. Examples of animals into which the TNFRSFl IB isogenes may be introduced 

include, but are not limited to, mice, rats, other rodents, and nonhuman primates (see "The Introduction of 
Foreign Genes into Mice" and the cited references therein. In: Recombinant DNA, Eds. J.D. Watson, M. 
Oilman, J. Witkowski, and M. ZoIIer; W.H. Freeman and Company, New York, pages 254-272). 
Transgenic animals stably expressing a human TNFRSFl IB isogene and producing human TNFRSFl IB 

30 protein can be used as biological models for studying diseases related to abnormal TNFRSFl IB 

expression and/or activity, and for screening and assaying various candidate drugs, compounds, and 
treatment regimens to reduce the synq)toms or effects of these diseases. 

An additional embodiment of the invention relates to pharmaceutical compositions for treating 
disorders affected by expression or function of a novel TNFRSFl 1 B isogene described herein. The 

35 pharmaceutical composition may comprise any of the following active ingredients: a polynucleotide 

comprising one of these novel TNFRSFl IB isogenes; an antisense oligonucleotide directed against one of 
the novel TNFRSFl IB isogenes, a polynucleotide encoding such an antisense oligonucleotide, or another 
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compound which inhibits expression of a novel TNFRSFl IB isogene described herein. Preferably, the 
composition contains the active ingredient in a therapeutically effective amount. By therapeutically 
effective amount is meant that one or more of the symptoms relating to disorders affected by expression 
or function of a novel TNFRSFl IB isogene is reduced and/or eliminated. The composition also 
S comprises a pharmaceutically acceptable carrier, examples of which include, but are not limited to, saline, 
buffered saline, dextrose, and water. Those skilled in the art may employ a formulation most suitable for 
the active ingredient, whether it is a polynucleotide, oligonucleotide, protein, peptide or small molecule 
antagonist. The pharmaceutical composition may be administered alone or in combination with at least 
one other agent, such as a stabilizing compound. Administration of the pharmaceutical composition may 

10 be by any number of routes including, but not limited to oral, intravenous, intramuscular, intra-arterial, 
intramedullary, intrathecal, intraventricular, intradermal, transdermal, subcutaneous, intraperitoneal, 
intranasal, enteral, topical, sublingual, or rectal. Further details on techniques for formulation and 
administration may be found in the latest edition of Remington's Pharmaceutical Sciences (Maack 
Publishing Co., Easton, PA). 

IS For any composition, determination of the therapeutically effective dose of active ingredient 

and/or the appropriate route of administration is well within the capability of those skilled in the art. For 
example, the dose can be estimated initially either in cell culture assays or in animal models. The animal 
model may also be used to determine the appropriate concentration range and route of administration. 
Such information can then be used to determine useful doses and routes for administration in humans. 

20 The exact dosage will be determined by the practitioner, in light of factors relating to the patient requiring 
treatment, including but not limited to severity of the disease state, general health, age, weight and gender 
of the patient, diet, time and frequency of administration, other drugs being taken by the patient, and 
tolerance/response to the treatment. 

Information on the identity of genotypes and haplotypes for the TNFRSFl IB gene of any 

25 particular individual as well as the frequency of such genotypes and haplotypes in any particular 

population of individuals is expected to be useful for a variety of basic research and clinical applications. 
Thus, the invention also provides compositions and methods for detecting the novel TNFRSFl IB 
polymorphisms identified herein. 

The compositions comprise at least one TNFRSFl IB genotyping oligonucleotide. In one 

30 embodiment, a TNFRSF 1 IB genotyping oligonucleotide is a probe or primer capable of hybridizing to a 
target region that is located close to, or that contains, one of the novel polymorphic sites described herein. 
As used herein, the term "oligonucleotide" refers to a polynucleotide molecule having less than about 100 
nucleotides. A preferred oligonucleotide of the invention is 10 to 35 nucleotides long. More preferably, 
the oligonucleotide is between 15 and 30, and most preferably, between 20 and 25 nucleotides in length. 

35 The oligonucleotide may be comprised of any phosphorylation state of ribonucleotides, 

deoxyribonucleotides, and acycHc nucleotide derivatives, and other functionally equivalent derivatives. 
Alternatively, oligonucleotides may have a phosphate-free backbone, which may be comprised of 
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linkages such as carboxymethyl, acetamidate, carbamate, polyamide (peptide nucleic acid (PNA)) and the 
like (Varma, R. in Molecular Biology and Biotechnology, A Comprehensive Desk Reference, Ed. R. 
Meyers, VCH Publishers, Inc. (1995), pages 617-620). OUgonucleotides of the invention may be 
prepared by chemical synthesis using any suitable methodology known in the art, or may be derived from 
S a biological sample, for example, by restriction digestion. The oligonucleotides may be labeled, 

according to any technique known in the art, including use of radiolabels, fluorescent labels, enzymatic 
labels, proteins, haptens, antibodies, sequence tags and the like. 

Genotyping oligonucleotides of the invention must be capable of specifically hybridizing to a 
target region of a TNFRSFl IB polynucleotide, i.e., a TNFRSFl IB isogene. As used herein, specific 

10 hybridization means the oligonucleotide forms an anti-parallel double-stranded structure with the target 
region under certain hybridizing conditions, while failing to forni such a structure when incubated with a 
non-target region or a non-TNFRSFl IB polynucleotide under the same hybridizing conditions. 
Preferably, the oligonucleotide specifically hybridizes to the target region under conventional high 
stringency conditions. The skilled artisan can readily design and test oligonucleotide probes and primers 

15 suitable for detecting polymorphisms in the T^fFRSFl IB gene using the polymorphism information 
provided herein in conjunction with the known sequence information for the TNFRSFl IB gene and 
routine techniques. 

A nucleic acid molecule such as an oligonucleotide or polynucleotide is said to be a "perfect" or 
"complete" complement of another nucleic acid molecule if every nucleotide of one of the molecules is 

20 complementary to the nucleotide at the corresponding position of the other molecule. A nucleic acid 
molecule is "substantially complementary" to another molecule if it hybridizes to that molecule with 
sufHcient stability to remain in a duplex form under conventional low-stringency conditions. 
Conventional hybridization conditions are described, for example, by Sambrook J. et al., in Molecular 
Cloning, A Laboratory Manual, 2"* Edition, Cold Spring Harbor Press, Cold Spring Harbor, NY (1 989) 

25 and by Haymes, B.D. et al, in Nucleic Acid Hybridization, A Practical Approach, IRL Press, Washington, 
D.C. (1985). While perfectly complementary oligonucleotides are preferred for detecting 
polymorphisms, departures from complete complementarity are contemplated where such departures do 
not prevent the molecule from specifically hybridizing to the target region. For example, an 
oligonucleotide primer may have a non-complementary fragment at its 5' end, with the remainder of the 

30 primer being complementary to the target region. Alternatively, non-contplementary nucleotides may be 
interspersed into the oligonucleotide probe or primer as long as the resulting probe or primer is still 
capable of specifically hybridizing to die target region. 

Preferred genotyping oligonucleotides of the invention are allele-specific oligonucleotides. As 
used herein, the term allele-specific oligonucleotide (ASO) means an oligonucleotide that is able, under 

35 sufficiently stringent conditions, to hybridize specifically to one allele of a gene, or other locus, at a target 
region containing a polymorphic site while not hybridizing to the corresponding region in another 
allele(s). As understood by the skilled artisan, allele-specificity will depend upon a variety of readily 
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optimized stringency conditions, including salt and fonnamide concentrations, as well as temperatures for 
both the hybridization and washing steps. Examples of hybridization and washing conditions typically 
used for ASO probes are found in Kogan et al., "Genetic Prediction of Hemophilia A" in PCR Protocols, 
A Guide to Methods and Applications, Academic Press, 1990 and Ruano et al., 87 Proc. Natl. Acad. Sci. 
5 USA 6296-6300, 1 990. Typically, an allele-specific oligonucleotide will be perfectly complementary to 
one allele while containing a single mismatch for another allele. 

Allele-specific oligonucleotide probes which usually provide good discrimination between 
different alleles are those in which a central position of the oligonucleotide probe aligns with the 
polymorphic site in the target region (e.g., approximately the 7* or 8* position in a 15 raer, the 8* or 9* 
10 position in a 16mer, the 10* or 1 1* position in a 20 mer). A prefenwd ASO probe for detecting 

TNFRSF 1 IB gene polymorphisms comprises a nucleotide sequence, listed 5 ' to 3 ', selected from the 
group consisting of: 
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15 


ACTTGAAGATGAATG 
ACTTGAAAATGAATG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


5) and its complement, 
6) and its complement. 




GATCTTGGCTGGATC 
GATCTTGTCTGGATC 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


7) 
8) 


and its complement, 
and its con^lement. 


20 


CCACCGCCCCACCCC 
CCACCGCTCCACCCC 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


9) 
10) 


and its complement, 
and its complement. 


25 


TCCCTGGGGGATCCT 
TCCCTGGTGGATCCT 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


11) 
12) 


and 
and 


its 
its 


complement , 
complement, 


GCGTTAATCCTGGAG 
GCGTTAACCCTGGAG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


13) 
14) 


and 
and 


its 
its 


complement, 
complement , 


30 


CCTGGGCCAGCCGAC 
CCTGGGCTAGCCGAC 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


15) 
16) 


and 
and 


its 
its 


complement , 
complement. 




GGGAGAAGGCTCCAC 
GGGAGAAAGCTCCAC 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


17) 
18) 


and 
and 


its 
its 


complement, 
complement. 


35 


CCTTTTACGCTGCAA 
CCTTTTAAGCTGCAA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


19) 
20) 


and 
and 


its 
its 


complement, 
complement. 


40 


Accession No.:AB008822 
GCTGGTACGTGTCAA (SEQ ID 
GCTGGTATGTGTCAA (SEQ ID 


NO: 
NO: 


21) 
22) 


and 
and 


its 
its 


complement, 
complement. 




AGGACCATTGCTCAG 
AGGACCACTGCTCAG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


23) 
24) 


and 
and 


its 
its 


complement , 
complement , 


45 


AACATAATAGTAGCA 
AACATAACAGTAGCA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


25) 
26] 


and 
and 


its 
its 


complement , 
complement. 




TATTTTCCGTAGGAA 
TATTTTCTGTAGGAA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


27) 
28) 


and 
and 


its 
its 


complement, 
complement. 
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10 



15 



20 



25 



30 



35 



40 



45 



SO 



CATTTTAGCATATTT 
CATTTTAACATATTT 



(SEQ ID NO: 29) and its complement, 
(SEQ ID NO: 30) and its complement, 



AAGTAAACGCAGAGA (SEQ ID NO: 31) and its complement, 
AAGTAAATGCAGAGA (SEQ ID NO: 32) and its complement. 



GTGTAGAGAGGATAA 
GTGTAGAAAGGATAA 

AGAGGATAAAACGGC 
AGAGGATGAAACGGC 



(SEQ ID NO: 33) and its complement, 
(SEQ ID NO: 34) and its complement, 

(SEQ ID NO: 35) and its complement, 
(SEQ ID NO: 36) and its con^>lement. 



TGAAGTTATGGAAAC (SEQ ID NO: 
TGAAGTTGTGGAAAC (SEQ ID NO: 



TGAAGTTATGGAAAC 
TGAAGTTGTGGAAAC 

GTATGATAATCTAAA 
GTATGATCATCTAAA 



37) and its complement, 

38) and its complement, 

(SEQ ID NO: 3 9) and its complement, 
(SEQ ID NO: 40) and its complement, 

(SEQ ID NO: 41) and its complement, 
(SEQ ID NO: 42) and its complement. 



TAGTTACGGCAATTA SEQ ID NO: 43) and its complement, 
TAGTTACAGCAATTA SEQ ID NO: 44) and its complement, 

AGAGTGATGTGTCTT SEQ ID NO: 45) and its complement, 
AGAGTGACGTGTCTT SEQ ID NO: 46) and its coniplement, 

GGCATTGGATGAATAT SEQ ID NO: 47) and its complement, 
GGCATTGTATGAATAT SEQ ID NO: 48) and its complement. 



AAACAGCGTGCAGCG 
AAACAGCATGCAGCG 



(SEQ ID NO: 49) and its con^lement, 
(SEQ ID NO: 50) and its complement. 



AAAGAAGACCATCAG SEQ ID NO: 51) and its complement, 
AAAGAAGTCCATCAG SEQ ID NO: 52) and its complement. 



TCAGAAGTTATTTTT 
TCA6AAGCTATTTTT 



(SEQ ID NO: 53). and its coitplement, and 
(SEQ ID NO: 54) and its complement. 



An allele-specific oligonucleotide primer of Ae invention has a 3' tenninal nucleotide, or 
preferably a 3' penultimate nucleotide, that is complementary to only one nucleotide of a particular SNP, 
thereby acting as a primer for polymerase-mediated extension only if the allele containing that nucleotide 
is present. Allelc-specific oligonucleotide primers hybridizing to either the coding or noncoding strand 
are contemplated by the invention. A preferred ASO primer for detecting TNFRSFl IB gene 
polymorphisms comprises a nucleotide sequence, listed 5' to 3', selected from the group consisting of: 



Accession No. :AB008821 

ATGTAAACTTGAAGA (SEQ ID NO : 55) 

ATGTAAACTTGAAAA (SEQ ID NO: 57) 

CGTCCGGATCTTGGC (SEQ ID NO: 59) 

CGTCCGGATCTTGTC (SEQ ID NO: 61) 

CAGACACCACCGCCC (SEQ ID NO: 63) 

CAGACACCACCGCTC (SEQ ID NO: 65) 



TTCGCAATCATTCATCT (SEQ ID NO: 56) 
TTCGCAATCATTCATTT (SEQ ID NO: 58) 

GAGTCCGATCCAGCC (SEQ ID NO: 60) ; 
GAGTCCGATCCAGAC (SEQ ID NO: 62); 

GCGTGAGGGGTGGGG (SEQ ID NO: 64); 
GCGTGAGGGGTGGAG (SEQ ID NO: 66) ; 
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CCCACCTCCCTGGGG 
CCCACCTCCCTGGTG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


67) ; 
69) ; 


GCGGAAAGGATCCCC 
GCGGAAAGGATCCAC 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


68) ; 
70) ; 


5 


CTGAAAGCGTTAATC 
CTGAAAGCGTTAACC 


(SEQ 


ID 
ID 


NO: 
NO: 


71); 
73) • 


AGAAAGCTCCAGGAT 
AGAAAGCTCCAGGGT 


(SEQ 
(SEO 


ID 
ID 


NO: 
NO: 


72) ; 
74) ; 


10 


TAAGTCCCTGGGCCA 
TAAGT CCCTGGGCTA 


(SEQ 


ID 
ID 


NO: 
MO : 


75) ; 

77) • 
''II 


GCACCCGTCGGCTGG 
GCACCCGTCGGCTAG 


(SEQ 
(SEO 


ID 
ID 


NO: 
NO: 


76) ; 
78) ; 


CCGGCGGGGAGAAGG 
CCGGCGGGGAGAAAG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


79) ; 
81) ; 


GA6CGAGTGGAGCCT 
GAGCGAGTGGAGCTT 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


80) ; 
82) ; 


15 


GGGTGTCCTTTTACG 
GGGTGTCCTTTTAAG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


83) ; 
85) ; 


GGAACTTTGCAGCGT 
GGAACTTTGCAGCTT 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


84) ; 
86) ; 


20 
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GTGCAAGCTGGTACG (SEQ ID 
GTGCAAGCTGGTATG (SEQ ID 


NO: 
NO: 


87) ; 
89) • 


TGCACATTGACACGT 
TGCACATTGACACAT 


(SEQ 
(SEO 


ID 
ID 


NO: 
NO: 


88) ; 
90) : 


TTCCAAAGGACCATT 
TTCCAAAGGACCACT 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO : 


91) ; 


ATTCCTCTGAGCAAT 
ATTCCTCTGAGCAGT 


(SEQ 
(SEO 


ID 
ID 


NO: 
NO: 


92) ; 
94) : 


25 


TTGTGCAACATAATA 
TTGTGCAACATAACA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO : 


95) ; 

97) • 
^'11 


TTTTACTGCTACTAT 
TTTTACTGCTACTGT 


(SEQ 
(SEO 


ID 
ID 


NO: 
NO : 


96) ; 
98) • 




GCTTGGTATTTTCCG 
GCTTGGTATTTTCTG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
WO • 


99) ; 


CTGGGGTTCCTACGG 
CTGGGGTTCCTACRG 


(SEQ 
(SEO 


ID 
ID 


NO: 
NO: 


100) ; 
102) • 


30 


ACTTTGCATTTTAGC 
ACTTTGCATTTTAAC 


(SEQ 
(SEQ 


ID 
ID 


NO: 


103) ; 


AAGATAAAATATGCT 
AAGATAAAATATGTT 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


104) ; 
106) ; 


35 


GCACCAAAGTAAACG 
GCACCAAAGTAAATG 


(SEQ 
(SEQ 


ID 
ID 


NO: 


107) ; 


CTACACTCTCTGCGT 
CTACACTCTCTGCAT 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


108) ; 
110) ; 


CAGAGAGTGTAGAGA 
CAGAGAGTGTAGAAA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO : 


111) 
113 ) 


CGCGTTTTATCCT (SEQ ID NO: 112) 
CGCGTTTTATCTT (SEQ ID NO: 114) 


40 


GTGTAGAGAGGATAA 
GTGTAGAGAGGATGA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO • 


115) ; 


TGTGTTGCCGTTTTA 
TGTGTTGCCGTTTCA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


116) ; 
118) ; 




AGCTGCTGAAGTTAT 
AGCTGCTGAAGTTGT 


(SEQ 
(SEQ 


ID 
ID 


NO: 

MO • 


119) ; 


TTTGATGTTTCCATA 
TTTGATGTTTCCACA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


120) ; 
122) ; 


45 


TCCAAGGTATGATAA 
TCCAAGGTATGATCA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO • 


123) ; 
125) • 


TTTTATTTTAGATTA 
TTTTATTTTAGATGA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


124) ; 
126) ; 


50 


AAAGGCTAGTTACGG 
AAAGGCTAGTTACAG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


127) ; 
129) • 


TGATAAGTTAATTGC 
TGATAAGTTAATTTC 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


128) 
130) 


AAACTGAGAGTGATG 
AAACTGAGAGTGACG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


131) ; 
133) ; 


GAAAATAAGACACAT 
GAAAATAAGACACGT 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


132) 
134) 


55 


TATGATGGCATTGGA 
TATGATGGCATTGTA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


135) ; 
137) ; 


CATTTATATTCATCC 
CATTTATATTCATAC 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


136) 
138) 




CTGTGAAAACAGCGT 


(SEQ 


ID 


NO: 


139) ; 


ATGTGCCGCTGCACG 


(SEQ 


ID 


NO: 


140) ; 
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CTGTGAAAACAGCAT 



(SEQ ID NO: 141); ATGTGCCGCTGCATG (SEQ ID NO: 142); 



GAGTCTAAAGAAGAC 
GAGTCTAAAGAAGTC 



(SEQ ID NO: 143); AGGAACCTGATGGTC (SEQ ID NO: 144); 
(SEQ ID NO: 145); AGGAACCTGATGGAC (SEQ ID NO: 146); 



5 



ATTGTATCAGAAGTT (SEQ ID NO: 147); ATTTCTAAAAATAAC (SEQ ID NO: 148); 
ATTGTATCAGAAGCT (SEQ ID NO: 149); and ATTTCTAAAAATAGC (SEQ ID NO: 150). 

Other genotyping oligonucleotides of the invention hybridize to a target region located one to 
10 several nucleotides downstream of one of the novel polymorphic sites identified herein. Such 

oligonucleotides are useful in polymerase-mediated primer extension methods for detecting one of the 
novel polymorphisms described herein and therefore such genotyping oligonucleotides are referred to 
herein as "primer-extension oligonucleotides". In a preferred embodiment, the 3 '-terminus of a primer- 
extension oligonucleotide is a deoxynucleotide complementary to the nucleotide located immediately 
15 adjacent to the polymorphic site. A particularly preferred oligonucleotide primer for detecting 

TNFRSFl IB gene polymorphisms by primer extension terminates in a nucleotide sequence, listed 5' to 
3', selected from the group consisting of: 
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TAAACTTGAA 


(SEQ 


ID 


NO: 


151) 


AATCATTCAT 


(SEQ 


ID 


NO: 


152) ; 


CCGGATCTTG 


(SEQ 


ID 


NO: 


153) 


TCCGATCCAG 


(SEQ 


ID 


NO: 


154) ; 


ACACCACCGC 


(SEQ 


ID 


NO: 


155) 


TGAGGGGTGG 


(SEQ 


ID 


NO: 


156) ; 


ACCTCCCTGG 


(SEQ 


ID 


NO; 


157) , 


GAAAGGATCC 


(SEQ 


ID 


NO: 


158) ; 


AAAGCGTTAA 


(SEQ 


ID 


NO: 


159) 


AAGCTCCAGG 


(SEQ 


ID 


NO: 


160) ; 


GTCCCTGGGC 


(SEQ 


ID 


NO: 


161) 


CCCGTCGGCT 


(SEQ 


ID 


NO: 


162) ; 


GCGGGGAGAA 


(SEQ 


ID 


NO: 


163) 


CGAGTGGAGC 


(SEQ 


ID 


NO: 


164) ; 


TGTCCTTTTA 


(SEQ 


ID 


NO: 


165) , 


ACTTTGCAGC 


(SEQ 


ID 


NO: 


166) ; 
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CAAGCTGGTA 


(SEQ 


ID 


NO: 


167) 


ACATTGACAC 


(SEQ 


ID 


NO: 


168) ; 


CAAAGGACCA 


(SEQ 


ID 


NO: 


169) 


CCTCTGAGCA 


(SEQ 


ID 


NO: 


170) ; 


TGCAACATAA 


(SEQ 


ID 


NO: 


171) 


TACTGCTACT 


(SEQ 


ID 


NO: 


172) ; 


TGGTATTTTC 


(SEQ 


ID 


NO: 


173) 


GGGTTCCTAC 


(SEQ 


ID 


NO: 


174) ; 


TTGCATTTTA 


(SEQ 


ID 


NO: 


175) , 


ATAAAATATG 


(SEQ 


ID 


NO: 


176) ; 


CCAAAGTAAA 


(SEQ 


ID 


NO: 


177) 


CACTCTCTGC 


(SEQ 


ID 


NO: 


178) ; 


AGAGTGTAGA 


(SEQ 


ID 


NO: 


179) 


GTTTTATCCT 


(SEQ 


ID 


NO: 


180) 


TAGAGAGGAT 


(SEQ 


ID 


NO: 


181) 


GTTGCCGTTT 


(SEQ 


ID 


NO: 


182) ; 


TGCTGAAGTT 


(SEQ 


ID 


NO: 


183) 


GATGTTTCCA 


(SEQ 


ID 


NO: 


184) ; 


AAGGTATGAT 


(SEQ 


ID 


NO: 


185) , 


TATTTTAGAT 


(SEQ 


ID 


NO: 


186) ; 


GGCTAGTTAC . 


(SEQ 


ID 


NO: 


187) ; 


AGTTAATTGC 


(SEQ 


ID 


NO: 


188) ; 


CTGAGAGTGA 


(SEQ 


ID 


NO: 


189) 


AATAAGACAC 


(SEQ 


ID 


NO: 


190) ; 


GATGGCATTG 


(SEQ 


ID 


NO: 


191) < 


TTATATTCAT 


(SEQ 


ID 


NO: 


192) ; 


TGAAAACAGC 


(SEQ 


ID 


NO: 


193) 


TGCCGCTGCA 


(SEQ 


ID 


NO: 


194) ; 


TCTAAAGAAG 


(SEQ 


ID 


NO: 


195) , 


AACCTGATGG 


(SEQ 


ID 


NO: 


196) ; 


6TATCAGAAG 


(SEQ 


ID 


NO: 


197) , 


and TCTAAAAATA 


(SEQ ID NO 


: 198) 



45 

Jn some embodiments, a composition contains two or more differently labeled genotyping 
oligonucleotides for simultaneously probing the identity of nucleotides at two or more polymorphic sites. 
It is also contemplated that primer compositions may contain two or more sets of allele-specific primer 
pairs to allow simultaneous targeting and amplification of two or more regions contaiiung a polymorphic 
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site. 

TNFRSFl IB genotyping oligonucleotides of the invention may also be immobilized on or 
synthesized on a solid surface such as a microchip, bead, or glass slide (see, e.g., WO 98/20020 and WO 
98/20019). Such immobilized genotyping oligonucleotides may be used in a variety of pqlymorphism 
S detection assays, including but not limited to probe hybridization and polymerase extension assays. 
Immobilized TNFRSFl IB genotyping oligonucleotides of the invention may comprise an ordered anay 
of oligonucleotides designed to rapidly screen a DNA sample for polymorphisms in multiple genes at the 
same time. 

In another embodiment, the invention provides a kit comprising at least two genotyping 

10 oligonucleotides packaged in separate containers. The kit may also contain other components such as 
hybridization buffer (where the oligonucleotides are to be used as a probe) packaged in a separate 
container. Alternatively, where the oligonucleotides are to be used to amplify a target region, the kit may 
contain, packaged in separate containers, a polymerase and a reaction buffer optimized for primer 
extension mediated by the polymerase, such as PCIL 

IS The above described oligonucleotide compositions and kits are useful in methods for genotyping 

and/or haplotyping the TNFRSFl IB gene in an individual. As used herein, flie terms "TNFRSFl IB 
genotype" and 'TNFRSFl IB haplotype" mean the genotype or haplotype contains the nucleotide pair or 
nucleotide, respectively, that is present at one or more of the novel polymorphic sites described herein 
and may optionally also include the nucleotide pair or nucleotide present at one or more additional 

20 polymorphic sites in the TNFRSFl IB gene. The additional polymorphic sites may be currently known 
polymorphic sites or sites that are subsequently discovered. 

One embodiment of the genotyping method involves isolating £rom the individual a nucleic acid 
mixture comprising the two copies of the TNFRSFl IB gene, or a fragment thereof, that are present in the 
individual, and determining the identity of the nucleotide pair at one or more of the polymorphic sites 

25 selected from PSl, PS2, PS3, PS4, PS5, PS7, PS8. PS9, PSIO, PSl 1. PS12, PS13, PS14, PS15, PS16, 

PS17, PS18, PS19, PS2G, PS21, PS22, PS23, PS24, and PS 25 in the two copies to assign a TNFRSFllB 
genotype to the individual. As will be readily understood by the skilled artisan, the two "copies" of a 
gene in an individual may be the same allele or may be different alleles. In a preferred embodiment of the 
genotyping method, the identity of the nucleotide pair at PS6 is also determined. In a particularly 

30 preferred embodiment, the genotyping method comprises determining the identity of the nucleotide pair 
ateachof PS1-2S. 

Typically, the nucleic acid mixture is isolated fi-om a biological sample taken fix>m the individual, 
such as a blood sample or tissue sample. Suitable tissue samples include whole blood, semen saliva, 
tears, urine, fecal material, sweat, buccal, skin and hair. The nucleic acid mixture may be con^rised of 
35 genomic DNA, mRNA, or cDNA and, in the latter two cases, the biological sample must be obtained 
from an organ in which the TNFRSFl IB gene is expressed. Furthermore it will be understood by the 
skilled artisan that mRNA or cDNA preparations would not be used to detect polymorphisms located in 
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introns or in 5 ' and 3 ' nontranscribed regions. If a TNFRSFl 1 B gene fragment is isolated, it must 
contain the polymorphic site(s) to be genotyped. 

One embodiment of the haplotyping method comprises isolating from the individual a nucleic 
acid molecule containing only one of the two copies of the TNFRSFl IB gene, or a fragment thereof, that 
S is present in the individual and determining in that copy the identity of the nucleotide at one or more of 
the polymorphic sites PSl, PS2, PS3, PS4, PS5, PS7, PS8, PS9. PSIO, PSl 1, PS12, PS13, PS14, PS15, 
PS 16, PS 17, PS 18, PS 19, PS20, PS21, PS22, PS23, PS24, and PS 25 in that copy to assign a 
TNERSFl IB haplotype to the individual. The nucleic acid may be isolated using any method capable of 
separating the two copies of the TNFRSFl IB gene or fragment such as one of the methods described 

10 above for preparing TNFRSFl IB isogenes, with targeted in vivo cloning being the preferred approach. 
As will be readily appreciated by those skilled in the art, any individual clone will only provide haplotype 
information on one of the two TNFRSFl IB gene copies present in an individual. If haplotype 
information is desired for the individual's other copy, additional TNFRSFl IB clones will need to be 
examined. Typically, at least five clones should be examined to have more than a 90% probability of 

IS haplotyping both copies of the TNFRSFl IB gene in an individual. In some embodiments, the 
haplotyping method also coiiiprises identifying the nucleotide at PS6. In a particularly preferred 
embodiment, the nucleotide at each of PS 1-25 is identified. 

In a preferred embodiment, a TNFRSFl IB haplotype pair is determined for an individual by 
identifying the phased sequence of nucleotides at one or more of the polymorphic sites selected from PS 1 , 

20 PS2, PS3, PS4, PS5, PS7, PS8, PS9, PSIO, PSl 1, PS12, PS13, PS14. PS15, PS16. PS17, PS18, PS19, 
PS20, PS21, PS22, PS23, PS24, and PS 25 in each copy of the TNFRSFl IB gene that is present in the 
individual. In a particularly preferred embodiment, the haplotyping method comprises identifying the 
piksed sequence of nucleotides at each of PSl-25 in each copy of the TNFRSFl IB gene. When 
haplotyping both copies of the gene, the identifying step is preferably performed with each copy of the 

25 gene being placed in separate containers. However, it is also envisioned that if the two copies are labeled 
with different tags, -or are otherwise separately distinguishable or identifiable, it could be possible in some 
cases to perform the method in the same container. For example, if first and second copies of the gene 
are labeled with different first and second fluorescent dyes, respectively, and an allele-specific 
oligonucleotide labeled with yet a third different fluorescent dye is used to assay the polymorphic site(s), 

30 then detecting a combination of the first and third dyes would identify the polymorphism in the first gene 
copy while detecting a combination of the second and third dyes would identify the polymorphism in the 
second gene copy. 

In both the genotyping and haplotyping methods, the identity of a nucleotide (or nucleotide pair) 
at a polymorphic site(s) may be determined by amplifying a target region(s) containing the polymorphic 
35 site(s) directly from one or both copies of the TNFRSFl IB gene, or fragment thereof, and the sequence of 
the amplified region(s) determined by conventional methods. It will be readily appreciated by the skilled 
artisan that only one nucleotide will be detected at a polymorphic site in individuals who are homozygous 
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at that site, while two different nucleotides will be detected if the individual is heterozygous for that site. 
The polymorphism may be identified directly, known as positive-type identification, or by inference, 
referred to as negative-type identification. For example, where a SNP is known to be guanine and 
cytosine in a reference population, a site may be positively determined to be either guanine or cytosine for 
5 an individual homozygous at that site, or both guanine and cytosine, if the individual is heterozygous at 
that site. Alternatively, the site may be negatively determined to he not guanine (and thus 
cytosine/cytosine) or not cytosine (and thus guanine/guanine). 

Li addition, the identity of the allele(s) present at any of the novel polymorphic sites described 
herein may be indirectly determined by genotyping a polymorphic site not disclosed herein that is in 

10 linkage disequilibrium with the polymorphic site that is of interest. Two sites are said to be in linkage 
disequililnium if the presence of a particular variant at one site enhances the predictability of another 
variant at the second site (Stevens, JC 1999, Mol. Diag. 4: 309-17). Polymorphic sites in linkage 
disequilibrium with the presently disclosed polymorphic sites may be located in regions of the gene or in 
other genomic regions not examined herein. Genotyping of a polymorphic site in linkage disequilibrium 

1 5 with the novel polymorphic sites described herein may be performed by, but is not limited to, any of the 
above-mentioned methods for detecting flie identity of the allele at a polymorphic site. 

The target region(s) may be amplified using any oligonucleotide-directed amplification method, 
including but not limited to polymerase chain reaction (PGR) (U.S. Patent No. 4,965, 188), ligase chain 
reaction (LCR) (Barany et al., Proc. Natl. Acad. Sci. USA 88:189-193, 1991; WO90/0 1069), and 

20 oligonucleotide ligation assay (OLA) (Landegren et al., Science 241 : 1077-1080, 1988). Ohgonucleotides 
useful as primers or probes in such methods should specifically hybridize to a region of the nucleic acid 
that contains or is adjacent to the polymorphic site. Typically, itie oligonucleotides are between 10 and 
35 nucleotides in lengA and preferably, between IS and 30 nucleotides in length. Most preferably, the 
oligonucleotides are 20 to 25 nucleotides long. The exact length of the oligonucleotide will depend on 

25 many factors that are routinely considered and practiced by the skilled artisan. 

Other known nucleic acid amplification procedures may be used to amplify the target region 
including transcription-based amplification systems (U.S. Patent No. 5,130,238; EP 329,822; U.S. Patent 
No. 5,169,766, WO89/06700) and isothermal methods (Walker et al.. Proc. Natl. Acad. Sci. USA 89:392- 
396, 1992). 

30 A polymorphism in the target region may also be assayed before or after amplification using one 

of several hybridization-based methods known in the art. Typically, allele-specific oligonucleotides are 
utilized in performing such methods. The allele-specific oligonucleotides may be used as differently 
labeled probe pairs, with one member of the pair showing a perfect match to one variant of a target 
sequence and Hie other member showing a perfect match to a different variant. In some embodiments, 

35 more than one polymorphic site may be detected at once using a set of allele-specific oligonucleotides or 
oligonucleotide pairs. Preferably, the members of the set have melting temperatures within 5°C, and 
more preferably within 2°C, of each other when hybridizing to each of the polymorphic sites being 
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detected. 

Hybridization of an allele-speciiic oligonucleotide to a target polynucleotide may be performed 
with both entities in solution, or such hybridization may be performed when either the oligonucleotide or 
the target polynucleotide is covalently or noncovalently affixed to a solid support. Attachment may be 
5 mediated, for example, by antibody-antigen interactions, poIy-L-Lys, streptavidin or avidin-biotin, salt 
bridges, hydrophobic interactions, chemical linkages, UV cross-linking baking, etc. Allele-specific 
oligonucleotides may be synthesized directly on the solid support or attached to the solid support 
subsequent to synthesis. Solid-supports suitable for use in detection methods of tiie invention include 
substrates made of silicon, glass, plastic, paper and the like, which may be formed, for example, into 
10 wells (as in 96-welI plates), slides, sheets, membranes, fibers, chips, dishes, and beads. The solid support 
may be treated, coated or derivatized to facilitate the immobilization of the allele-specific oligonucleotide 
or target nucleic acid. 

The genotype or haplotype for the TNFRSFl IB gene of an individual may also be determined by 
hybridization of a nucleic sample containing one or both copies of the gene to nucleic acid arrays and 

15 subarrays such as described in WO 95/1 1995. The arrays would contain a battery of allele-specific 

oligonucleotides representing each of the polymorphic sites to be included in the genotype or haplotype. 

The identity of polymorphisms may also be determined using a mismatch detection technique, 
including but not limited to the RNase protection method using riboprobes (Winter et al., Proc. Natl. 
Acad. Sci. USA 82:7575, 1985; Meyers et al., Science 230:1242, 1985) and proteins which recognize 

20 nucleotide inismatches, such as the E. coli mutS protein (Modrich, P. Ann. Rev. Genet. 25:229-253, 
1 991). Alternatively, variant alleles can be identified by single strand conformation polymorphism 
(SSCP) analysis (Orita et al., Genomics 5:874-879, 1989; Humphries et al., in Molecular Diagnosis of 
Genetic Diseases, R. EUes, ed., pp. 321-340, 1996) or denaturing gradient gel electrophoresis (DGGE) 
(Wartell et al., Nucl. Acids Res. 18:2699-2706, 1990; Sheffield et al., Proc. Natl. Acad. Sci. USA 86:232- 

25 236, 1989). 

A polymerase-mediated primer extension method may also be used to identify the 
polymorphism(s). Several such methods have been described in the patent and scientific literature and 
include the "Genetic Bit Analysis" method (W092/15712) and the ligase/polymerase mediated genetic bit 
analysis (U.S. Patent 5,679,524. Related methods are disclosed in WO91/02087, WO90/09455, 

30 W095/1 7676, U.S. Patent Nos. 5,302,509, and 5,945,283 . Extended primers containing a polymorphism 
may be detected by mass spectrometry as described in U.S. Patent No. 5,605,798. Another primer 
extension method is allele-specific PGR (Ruailo et al., Nucl. Acids Res. 17:8392, 1989; Ruafio et al., 
Nucl. Acids Res. 19, 6877-6882, 1991; WO 93/22456; Turki et al., J. Clin. Invest. 95:1635-1641, 1995). 
In addition, multiple polymorphic sites may be investigated by simultaneously amplifying multiple 

35 regions of the nucleic acid using sets of allele-specific primers as described in Wallace et al. 
{WO89/10414). 

hi another aspect of the invention, an individual's TNFRSFl IB haplotype pair is predicted from 
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its TNFRSFl IB genotype using information on haplotype pairs known to exist in a reference population. 
In its broadest embodiment, the haplotyping prediction method comprises identifying a TNFRSFl IB 
genotype for the individual at two or more polymorphic sites selected from PSl, PS2, PS3, PS4, PS5, 
PS7, PS8, PS9, PSIO, PSll, PS12, PS13, PS14, PS15, PS16, PS17, PS18, PS19, PS20, PS21, PS22, 
5 PS23, PS24, and PS 25, enumerating all possible haplotype pairs which are consistent with the genotype, 
accessing data containing TNFRSFl IB haplotype pairs identified in a reference population, and assigning 
a haplotype pair to the individual that is consistent with the data. Li one embodiment, the reference 
haplotype pairs include the TNFRSFl IB haplotype pairs shown in Table 4. 

Generally, the reference population should be composed of randomly-selected individuals 
10 representing the major ethnogeographic groups of the world. A preferred reference population for use in 
the methods of the present invention comprises an approximately equal number of individuals from 
Caucasian, Afirican American, Asian and Hispanic>Latino population groups with the minimum number 
of each group being chosen based on how rare a haplotype one wants to be guaranteed to see. For 
example, if one wants to have a q% chance of not missing a haplotype that exists in the population at a 
15 p% frequency of occurring in the reference population, the number of individuals (n) who must be 

sampled is given by 2n=log(l-q)/log(l -p) where p and q are expressed as fractions. A preferred reference 
population allows the detection of any haplotype whose frequency is at least 10% with about 99% 
certainty and comprises about 20 unrelated individuals from each of the four population groups named 
above. A particularly preferred reference population includes a 3-generation family representing one or 
20 more of the four population groups to serve as controls for checking quality of haplotyping procedures. 

In a preferred embodiment, the haplotype frequency data for each ethnogeographic group is 
examined to determine whether it is consistent with Hardy-Weinberg equilibrium. Hardy-Weinberg 
equilibrium (D.L. Hartl et al.. Principles of Population Genomics, Sinauer Associates (Sunderland, MA), 
3"* Ed., 1997) postulates that the frequency of finding the haplotype pair //, / is equal to 
25 P».AHJH,)=2p{H,)p{H^) if H, * H^aad p„.AHjH^) = piH,)p{H^) if//, =H^. \ 

statistically significant difference between the observed and expected haplotype frequencies could be due 
to one or more factors including significant inbreeding in the population group, strong selective pressure 
on the gene, sampling bias, and/or errors in the genotyping process. If large deviations from Hardy- 
Weinberg equilibrium are observed in an ethnogeographic group, the number of individuals in that group 

30 can be increased to see if the deviation is due to a sampling bias. If a larger sample size does not reduce 
the difference between observed and expected haplotype pair frequencies, then one may wish to consider 
haplotyping the individual using a direct haplotyping method such as, for example, CLASPER System™ 
technology (U.S. Patent No. 5,866,404), SMD, or allele-specific long-range PCR (Michalotos-Beloin et 
al., Nucleic Acids Res. 24:4841-4843, 1996). 

35 In one embodiment of this method for predicting a TNFRSFl IB haplotype pair, the assigning 

step involves performing the following analysis. First, each of the possible haplotype pairs is compared 
to the haplotype pairs in the reference population. Generally, only one of the haplotype pairs in the 
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reference population matches a possible haplotype pair and that pair is assigned to the individual. 
Occasionally, only one haplotype represented in the reference haplotype pairs is consistent with a possible 
haplotype pair for an individual, and in such cases the individual is assigned a haplotype pair containing 
this known haplotype and a new haplotype derived by subtracting the known haplotype from the possible 
5 haplotype pair. In rare cases, either no haplotypes in the reference population are consistent with the 
possible haplotype pairs, or alternatively, multiple reference haplotype pairs are consistent with the 
possible haplotype pairs. In such cases, the individual is preferably haplotyped using a direct molecular 
haplotyping method such as, for example, CLASPER System™ technology (U.S. Patent No. 5,866,404), 
SMD, or allele-specific long-range PCR (Michalotos-Beloin et al., Nucleic Acids R^. 24:4841-4843, 
10 1996). 

The invention also provides a method for determining the frequency of a TNFRSFl IB genotype 
or TNFRSFl IB haplotype in a population. The method comprises determining the genotype or tiie 
haplotype pair for the TNFRSFl IB gene that is present in each member of the population, wherein the 
genotype or haplotype comprises tite nucleotide pair or nucleotide detected at one or more of the 

15 polymorphic sites PSl, PS2, PS3, PS4, PS5, PS7, PS8, PS9, PSIO, PSl 1. PS12, PS13, PS14, PS15, PS16, 
PS17, PS18, PS19, PS20, PS21, PS22, PS23, PS24, and PS 25 in the TNFRSFl IB gene; and calculating 
Qie frequency any particular genotype or haplotype is found in the population. The population may be a 
reference population, a family population, a same sex population, a population group, a trait population 
(e.g., a group of individuals exhibiting a trait of interest such as a medical condition or response to a 

20 therapeutic treatment). 

In another aspect of the invention, frequency data for TNFRSFl IB genotypes and/or haplotypes 
found in a reference population are used in a method for identifying an association between a trait and a 
TNFRSFl IB genotype or a TNFRSFl IB haplotype. The bait may be any detectable phenotype, 
including but not limited to susceptibility to a disease or response to a treatment. The method involves 

25 obtaining data on the frequency of the genotype(s) or haplotype(s) of interest in a reference population as 
well as in a population exhibiting the trait. Frequency data for one or both of the reference and trait 
populations may be obtained by genotyping or haplotyping each individual in the populations using one 
of the methods described above. The haplotypes for the trait population may be determined directly or, 
alternatively, by the predictive genotype to haplotype approach described above. In another embodiment, 

30 the frequency data for the reference and/or trait populations is obtained by accessing previously 

determined frequency data, which may be in written or electronic form. For example, the frequency data 
may be present in a database that is accessible by a computer. Once the frequency data is obtained, the 
frequencies of the genotype(s) or hap]otype(s) of interest in the reference and trait populations are 
compared. In a preferred embodiment, the frequencies of all genotypes and/or haplotypes observed in the 

35 populations are compared. If a particular genotype or haplotype for the TNFRSFl IB gene is more 

frequent in the trait population than in the reference population at a statistically significant amount, then 
the trait is predicted to be associated with that TNFRSFl IB genotype or haplotype. Preferably, the 
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TNFRSFl IB genotype or haplotype being compared in the trait and reference populations is selected 
from the full-genotypes and iull-haplotypes shown in Tables 4 and 5, respectively, or from sub-genotypes 
and sub-haplotypes derived from these genotypes and haplotypes. 

In a preferred embodiment of the method, the trait of interest is a clinical response exhibited by a 
S patient to some therapeutic treatment, for example, response to a drug targeting TNFRSFl IB or response 
to a therapeutic treatment for a medical condition. As used herein, "medical condition" includes but is 
not limited to any condition or disease manifested as one or more physical and/or psychological 
symptoms for which treatment is desirable, and includes previously and newly identified diseases and 
other disorders. As used herein the term "clinical response" means any or all of the following; a 

10 quantitative measure of the response, no response, and adverse response (i.e., side effects). 

In order to deduce a correlation between clinical response to a treatment and a TNFRSFl IB 
genotype or haplotype, it is necessary to obtain data on the clinical responses exhibited by a population of 
individuals who received the treatment, hereinafter the "clinical population". Hiis clinical data may be 
obtained by analyzing the results of a clinical trial that has already been run and/or &e clinical data may 

15 be obtained by designing and carrying out one or more new clinical trials. As used herein, the term 
"clinical trial" means any research study designed to collect clinical data on responses to a particular 
treatment, and includes but is not limited to phase I, phase 11 and phase III clinical trials. Standard 
methods are used to define the patient population and to enroll subjects. 

It is preferred that the individuals included in the clinical population have been graded for the 

20 existence of the medical condition of interest. This is important in cases where the symptom(s) being 
presented by the patients can be caused by more than one underlying condition, and where treatment of 
the underlying conditions are not the same. An example of this would be where patients experience 
breathing difficulties fliat are due to either asthma or respiratory infections. If both sets were treated'with 
an asthma medication, there would be a spurious group of apparent non-responders that did not actually 

25 have asthma. These people would affect the ability to detect any correlation between haplotype and 
treatment outcome. This grading of potential patients could employ a standard physical exam or one or 
more lab tests. Alternatively, grading of patients could use haplotyping for situations where there is a 
strong correlation between haplotype pair and disease susceptibility or severity. 

The therapeutic treatment of interest is administered to each individual in the trial population and 

30 each individual's response to the treatment is measured using one or more predetermined criteria. It is 
contemplated that in many cases, the trial population will exhibit a range of responses and that the 
investigator will choose the number of responder groups (e.g., low, medium, high) made up by the 
various responses. In addition, the TNFRSFllB gene for each individual in the trial population is 
genotyped and/or haplotyped, which may be done before or after administering the treatment. 

35 After both the clinical and polymorphism data have been obtained, correlations between 

individual response and TNFRSFl IB genotype or haplotype content are created. Correlations may be 
produced in several ways. In one method, individuals are grouped by their TNFRSFl IB genotype or 
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hapiotype (or haplotype pair) (also referred to as a polymorphism group), and then the averages and 
standard deviations of clinical responses exhibited by the members of each polymorphism group are 
calculated. 

These results are then analyzed to determine if any observed variation in clinical response 
S between polymorphism groups is statistically significant. Statistical analysis methods which may be used 
are described in L.D. Fisher and G. vanBelle, "Biostatistics: A Methodology for the Health Sciences", 
Wiley-Interscience (New York) 1993. This analysis may also include a regression calculation of which 
polymorphic sites in the TNFRSFl IB gene give the most significant contribution to the differences in 
phenotype. One regression model useful in the invention starts with a model of the form 
10 r = + 5 X d 

where r is the response, is a constant called the "intercept", S is the slope and d is the dose. To 

determine the dose, the most-common and least common nucleotides at the polymorphic site are first 
defined. Then, for each individual in the trial population, one calculates a "dose" as the number of least- 
common nucleotides the individual has at the polymorphic site of interest. This value can be 0 

15 (homozygous for the least-common nucleotide), 1 (heterozygous), or 2 (homozygous for the most 
common nucleotide). An individual's "response" is the value of the clinical measurement. Standard 
linear regression methods are then used to fit all the individuals' doses and responses to a single model 
(see e.g., L.D. Fisher and G. vanBelle, supra, Ch 9). The outputs of die regression calculation are the 
intercept , the slope S, and the variance (which measures how well the data fits this simple linear 

20 model). The Students t-test value and the level of significance can then be calculated for each of the 
polymorphic sites. 

A second method for finding correlations between TNFRSFl IB haplotype content and clinical 
responses uses predictive models based on error-minimizing optimization algorithms. One of many 
possible optimization algorithms is a genetic algorithm (R. Judson, "Genetic Algorithms and Their Uses 

25 in Chemistry" in Reviews in Computational Chemistry, Vol. 10, pp. 1-73, K. B. Lipkowitz and D. B. 

Boyd, eds. (VCR Publishers, New York, 1997). Simulated annealing (Press et al., "Numerical Recipes in 
C: The Art of Scientific Coniputing", Cambridge University Press (Cambridge) 1992, Ch. 10), neural 
networks (E. Rich and K. Knight, "Artificial Intelligence", 2"* Edition (McGraw-Hill, New York, 1991, 
Ch. 18), standard gradient descent methods (Press et al., supra Ch. 10), or other global or local 

30 optimization approaches (see discussion in Judson, supra) could also be used. As an example, a genetic 
algorithm approach is described herein. This method searches for optimal parameters or weights in linear 
or non-linear models connecting TNFRSFl IB haplotype loci and clinical outcome. One model is of the 
form 



35 where C is the measured clinical outcome, i goes over all polymorphic sites, a over all candidate genes, 




[1] 
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Co , w,^ and w',^ are variable weight values, R. ^ is equal to 1 if site i in gene a in the first haplotype 

takes on the most conunon nucleotide and -1 if it takes on the less common nucleotide. L/^ is the same 

as Ri g except for the second haplotype. The constant term and the weights w,^ and w',^ are varied 

by the genetic algorithm during a search process that minimizes the error between the measured value of 
S C and the value calculated from Equation 1 . Models other than the one given in Equation 1 can be readily 
incoiporated by those skilled in the art for analyzing the clinical.and polymorphism data. The genetic 
algorithm is especially suited for searching not only over the space of weights in a particular model but 
also over the space of possible models (Judson, supra). 

Correlations may also be analyzed using analysis of variation (ANOVA) techniques to determine 

10 how much of the variation in the clinical data is explained by different subsets of the polymorphic sites in 
the TNFRSFl IB gene. ANOVA is used to test hypotheses about whether a response variable is caused 
by or correlated with one or more traits or variables thait can be measured (Fisher and vanBelle, supra, Ch. 
10). These traits or variables are called the independent variables. To carry out ANOVA, the 
independent variable(s) are measured and individuals are placed into groups based on their values for 

15 these variables. In this case, the independent variable(s) refers to the combination of polymorphisms 
present at a subset of the polymorphic sites, and thus, each group contains those individuals with a given 
genotype or haplotype pair. The variation in response within the groups and also the variation between 
groups is then measured. If the within-group response variation is large (people in a group have a wide 
range of responses) and the response variation between groups is small (the average responses for all 

20 groups are about the same) then it can be concluded that the independent variables used for the grouping 
are not causing or correlated with tiie response variable. For instance, if people are grouped by month of 
birth (which should have nothing to do with their response to a drug) the ANOVA calculation should 
show a low level of significance. However, if the response variation is larger between groups than within 
groups, the F-ratio (="between groups" divided by "within groups") is greater than one. Large values of 

25 the F-ratio indicate that the independent variable is causing or correlated with the response. The 
calculated F-ratio is preferably compared with the critical F-distribution value at whatever level of 
significance is of interest. If the F-ratio is greater than the Critical F-distribution value, then one may be 
confident that the individual's genotype or haplotype pair for this particular subset of polymorphic sites in 
the TNFRSFl IB gene is at least partially responsible for, or is at least strongly corirelated with the clinical 

30 response. 

From the analyses described above, a mathematical model may be readily constructed by the 
skilled artisan that predicts clinical response as a function of TNFRSFl IB genome or haplotype content. 
Preferably, the model is validated in one or more follow-up clinical trials designed to test the model. 

The identification of an association between a clinical response and a genotype or haplotype (or 
35 haplotype pair) for the TNFRSF 1 IB gene may be the basis for designing a diagnostic method to 

determine those individuals who will or will not respond to the treatment, or alternatively, will respond at 
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a lower level and thus may require more treatment, i.e., a greater dose of a drug. The diagnostic method 
may take one of several forms: for example, a direct DNA test (i.e., genotyping or haplotyping one or 
more of the polymorphic sites in the TNFRSFl IB gene), a serological test, or a physical exam 
measurement. The only requirement is that there be a good correlation between the diagnostic test results 
5 and the underlying TNFRSFl IB genotype or haplotype that is in turn correlated with the clinical 
response. In a preferred embodiment, this diagnostic method uses the predictive haplotyping method 
described above. 

Any or all analytical and mathematical operations involved in practicing the methods of the 
present invention may be implemented by a computer. In addition, the computer may execute a program 

10 that generates views (or screens) displayed on a display device and with which the user can interact to 
view and analyze large amounts of information relating to the TNFRSFl IB gene and its genomic 
variation, including chromosome location, gene structure, and gene family, gene expression data, 
polymorphism data, genetic sequence data, and clinical data population data (e.g., data on 
ethnogeographic origin, clinical responses, genotypes, and haplotypes for one or more populations). The 

1 S TNFRSFl 1 B polymorphism data described herein may be stored as part of a relational database (e.g., an 
instance of an Oracle database or a set of ASCII flat files). These polymorphism data may be stored on 
the computer's hard drive or may, for example, be stored on a CD ROM or on one or more other storage 
devices accessible by the computer. For example, the data may be stored on one or more databases in 
communication with the computer via a network. 

20 Prefenred embodiments of the invention are described in the following examples. Other 

embodiments within the scope of the claims herein will be apparent to one skilled in the art from 
consideration of the specification or practice of the invention as disclosed herem. It is intended that the 
specification, together with the examples, be considered exemplary only, with the scope and spirit of the 
invention being indicated by the claims which follow the examples. 

25 

EXAMPLES 

The Examples herein are meant to exemplify the various aspects of carrying out the invention and 
are not intended to limit the scope of the invention in any way. The Exanqiles do not include detailed 
descriptions for conventional methods employed, such as in the performance of genomic DNA isolation, 
30 PCR and sequencing procedures. Such methods are well-known to those skilled in the art and are 

described in numerous publications, for exanq>le, Sambrook, Fritsch, and Maniatis, "Molecular Cloning: 
A Laboratory Manual", 2"" Edition, Cold Spring Harbor Laboratory Press, USA, (1989). 

Example lA 

35 This example illustrates examination of various regions of the TNFRSFl IB gene for 

polymorphic sites using DNA from Index Repository LA. 
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Amplification of Target Regions 

The following target regions of the TNFRSFl IB gene were amplified using the PCR primer pairs 
listed below, with the sequences presented in the S' to 3' direction and nucleotide positions shown for 
each region corresponding to the indicated GenBank Accession No. 

5 

Accession Number: AB008821 
Fragment 1 
Forward Primer 

599-618 GTACGGCGGAAACTCACAGC (SEQ ED NO: 199) 

10 Reverse Primer 

Complement of 1 1 90- 11 68 GC ACAGCAACTTGTTCATTGTGG (SEQ ID NO:200) 
PCR product 592 nt 

Fragment 2 
15 Forward Primer 

864-883 TCTCCCAGGGGACAGACACC (SEQ ID NO:20 1 ) 

Reverse Primer 

Complement of 1389-1367 TAAAAGGACACCCTAGGGGAAGC (SEQ ID NO:202) 
PCR product 526 nt 

20 

Fragment 3 
Forward Primer 

949-970 ATCCTGGAGCTTTCTGCACACC (SEQ ID NO:203) 

Reverse Primer 

25 Complement of 1609-1584 CTGAATCTAAGGGACCACTTCTTTGC (SEQ ID NO:204) 
PCR product 661 nt 

Accession Number: AB008822 
Fragment 4 
30 Forward Primer 

95-1 1 8 GCTAAGATGATGCCACTGTGTTCC (SEQ ID NO:205) 

Reverse Primer 

Complement of 699-677 GCCCTGTAGTGGCAAAGTATTCC (SEQ ED NO:206) 

PCR product 605 nt 

35 

Fragment 5 
Forward Primer 

4239-4262 CCTTTCCTCTCACATTTCATGAGC (SEQ ID NO:207) 
Reverse Primer 

40 Complement of 4976-4955 CAAACTTGACACTGCCCTTTGC (SEQ ID NO:208) 
PCR product 738 nt 

Fragment 6 
Forward Primer 

45 6526-6550 ACACGTAAGCTCAGTTGQTCTCTGC (SEQ ID NO:209) 
Reverse Primer 

Complement of 7 1 44-7 1 23 GAGCAGGGGTGAGAACAA AACC (SEQ ID NO:2 1 0) 
PCR product 619 nt 

50 Fragment 7 

Forward Primer 

8433-8455 CTTCCAAAGTnTGITGGATGCC (SEQ ID N0:21 1) 
Reverse Primer 

Complement of 91 19-9096 AGTTTACTCATCCATGGGATCTCG (SEQ ED NO:2 12) 

35 
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PGR product 687 nt 

These primer pairs were used in PGR reactions containing genomic DNA isolated firom 
immortalized cell lines for each member of Index Repository lA. The PGR reactions were carried out 
5 under the following conditions: 
Reaction volume 

10 X Advantage 2 Polymerase reaction buffer (Clontech) 
100 ng of human genomic DNA 
10 mM dNTP 

10 Advantage 2 Polymerase enzyme mix (Clontech) 
Forward Primer (10 |jM) 
Reverse Primer (10 yM) 
Water 

15 Amplification profile: 

94''C - 2 min. 1 cycle 

94°C - 30 sec. 

70°C-45sec. 10 cycles 

20 72'*G - 1 min. 

94''C - 30 sec. 

64''G-45sec. 35 cycles 

72''G - 1 min. 

25 Sequencing of PGR Products 

The PGR products were purified by Solid Phase Reversible Immobilization using the protocol 
developed by the Whitehead Genome Center. A detailed protocol can be found at 
http://www.genome.wi . mit.edu/sequencing/protocols/pure/SPRI_pcr.html . 

Briefly, five ^1 of carboxyl coated magnetic beads (10 mg/ml) and 60 ^l of HYB BUFFER (2.5M 

30 NaGl/20% PEG 8000) were added to each PGR reaction mixture (20 ^1). The reaction mixture was 
mixed well and incubated at room temperature (RT) for 10 min. The microtitre plate was placed on a 
magnet for 2 min and the beads washed twice with 150 ^1 of 70% EtOH. The beads were air dried for 2 
min and the DNA was eluted in 25 jil of distilled water and incubated at RT for 5 min. The beads were 
magnetically separated and the supernatant removed for testing and sequencing. 

35 The purified PGR products were sequenced in both directions using the primer sets described 

previously or those listed, in the 5 ' to 3 ' direction, below. 

Accession Number: AB008821 
Fragment 1 
40 Forward Primer 

624-643 GGGAGGGAGAGGAGAAAGGT (SEQ ID NO:213) 

Reverse Primer 

Complement of 1162-1 144 GGAAAGGTCAGGGGGTTGG (SEQ ID NO:214) 

45 Fragment 2 

Forward Primer 
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930-950 CCCAGCCCTGAAAGCGTTAAT (SEQ ID N0:215) 

Reverse Primer 

Complement of 1 353-1334 AAAGCGGTTrCCTGCTCCAG (SEQ ID NO:216) 

S Fragment 3 
Forward Primer 

983-1 002 CCGCCCAAGCTTCCTAAAAA (SEQ ID NO:2 1 7) 
Reverse Primer 

Complement of 1457-1436 TCTCCCTCTCTCTCGCTGTCTG (SEQ ID NO:2 1 8) 

10 

Accession Number: AB008822 
Fragment 4 
Forward Primer 

151-170 AGTGGACCACCCAGGAAACG (SEQ ID NO:219) 

15 Reverse Primer 

Complement of 67 1-652 GCAATGGTCCTTTGGAAGCA (SEQ ID NO:220) 

Fragment 5 
Forward Primer 

20 4314-4333 AACTGGCAAAGGGGATGATG (SEQ ID NO:221) 
Reverse Primer 

Complement of 4843-4822 GCATCGAGAGTAGCCTTAGCTG (SEQ ID NO:222) 

. Fragment 6 
25 Forward Primer 

6558-6577 ACCAGCCAACAGAAGCTTGA (SEQ ID NO:223) 
Reverse Primer 

Complement of 7069-7048 GTCCAACAATGATTCCAACAGG (SEQ ID NO:224) 

30 Fragment 7 

Forward Primer 

8520-8541 GGTGTCACTTAACTCCCTCTCA (SEQ ID NO:225) 
Reverse Primer 

Complement of 9039-9020 CTGATTGGACCTGGTTACCT (SEQ ID NO:226) 

35 

Analvsis of Sequences for Polymorphic Sites 

Sequences were analyzed for the presence of polymorphisms using the Polyphred program 
(Nickerson et al.. Nucleic Acids Res. 14:2745-2751, 1997). The presence of a polymorphism was 
confirmed on both strands. The polymorphisms and their locations in the TNFRSFl IB gene are listed in 
40 Table 3 below. 



Example IB 

This example illustrates examination of the OCIF gene for polymorphic sites in the following 
target regions: 2000 base pairs upstream of the ATG start codon; each of the exons, including 
45 approximately 100 base pairs on either side of die exon; and 500 to 1000 base pairs downstream of the 
termination codon. 



Amplification of Target Regions 

PCR primer pairs, which were designed based on the nearly complete OCIF genomic sequence 
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reported in the Genbank database (Accession Nos. AB008821 and AB008822), are set forth below: 



10 



Accession No. AB008821 
Promoter fragment 1 
Forward primer; 

69-94 5'- CTGTAAACAATTTCAGTGGCAACCCG -3' (SEQ ID NO:227) 

Reverse primer 

427-402 5'- CCGTGCTATTCTGCATTCACTCCTTG -3' (SEQ ID NO:228) 

PGR product 359 nt product 



Promoter fragment 2 
Forward primer: 

263-288 5'.CGTAGGAAGCTCCGATACCAATAGCC-3' (SEQ ID NO:229) 

Reverse primer: 

15 868-846 5'- GGAGAGCAGGGGAAAAAAAAGCC -3* (SEQ ID NO:230) 

PGR product 606 nt 

Promoter fragment 3 
Forward primer: 

20 558-583 5'- TTGAGGTTTCAGAACCCGAAGTGAAG -3' (SEQIDNO:231) 

Reverse primer 

1069-1045 5'- CAACAGGAAGTATCGCCTGCCTTTG -3' (SEQ ID NO:232) 
PGR product 512 nt 

25 Exon 1 

Forward primer: 

945-970 5'- GTrAATCCTGGAGCTTTCTGCACACC -3' (SEQ ID NO:233) 

Reverse primer: 

1 445- 1 420 5'- TCGCTGTCTGTCTCTCTCTTGCTGTC -3' (SEQ ID NO:234) 
30 PGR product 501 nt 

Accession No. AB008822 
Exon 2 

Forward primer: 

35 91-1 14 5'- TGATGCTAAGATGATGCCAGTGTG -3' (SEQ ID NO:235) 
Reverse primer: 

686-665 5'- GGAAAGTATTGCTGTGAGCAATGG -3' (SEQ ED NO:236) 

PGR product: 598 nt 

40 Exon 3 

Forward primer: 

4435-4457 5'- CTGTGTTAAGAGGGCATCTGCTG -3* (SEQ ID NO:237) 
Reverse primer: 

4791-4770 5'- TTGACGAAGAATGTGGGTGGAG -3' (SEQ ID NO:238) 

45 PGR product: 357 nt 

Exon 4 

Forward primer: 

6549-6573 5'- GGGACTAAGAGCAGCCAACAGAAGC -3' (SEQ ID NO:239) 
50 Reverse primer: 

7 1 66-7 1 20 5'- GAGGAGGGGTGAGAAGAAAAGGTTG -3' (SEQ ID NO:240) 
PGR product: 596 nt 

Exon 5 
55 Forward primer: 
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8150-8175 5'- GGCTGTGTGTCTCCrrrAGTTCCTCG -3' (SEQ ID NO:241) 
Reverse primer: 

9263-9238 5'- CTGACAGTTGGATTAACCATTTGGGG -3' (SEQ ID NO: 242) 
PCR product: llHnt 

These primer pairs were used in PCR reactions containing genomic DNA isolated from 
immortalized cell lines for a reference population of 70 human individuals. The PCR reactions were 
carried out under the following conditions: 

Reaction volume 

10 X Advantage 2 Polymerase reaction buffer (Clontech) 
100 ng of human genomic DNA 1 
lOmMdNTP 

Advantage 2 Polymerase enzyme mix (Clontech) 
Forward Primer (10 fiM) 
Reverse Primer (10 pM) 
Water 

Amplification profile: 

94"'C - 2 min. 1 cycle 

94^ - 30 sec. 

70''C-45sec. 10 cycles 

72°C - 1 min. 

94°C-30sec. 

64''C-45sec. 35 cycles 

72°C - 1 min. 

Sequencing of PCR Products 

The PCR products were purified by Solid Phase Reversible Immobilization using the protocol 
developed by the Whitehead Genome Center. A detailed protocol can be foimd at 
httD://www.genome. wi.mit.edu/seauencing/ protocols/pure/SPRI_pcr.html. 

Briefly, carboxyl coated magnetic beads (10 mg/ml) were washed three times with wash buffer 
(0.5 M EDTA, pH 8.0). Ten fil of washed beads and 50 fil of HYB BUFFER (2.5M NaCl/20% PEG 
8000) were added to each PCR reaction mixture (50 ^1). The reaction mixture was mixed well and 
incubated at RT for 10 min. The microtitre plate was placed on a magnet for 2 min and the beads washed 
twice widi 150 ^1 of 70% EtOH. The beads were air dried for 2 min and resuspend in 20 yd of elution 
buffer (10 mM trisacetate, pH 7.8) and incubated at RT for 5 min. The beads were magnetically 
separated and the supernatant removed for testing and sequencing. 

The purified PCR products were sequenced in both directions using sequencing primers that were 

identical to the PCR primers except for the exon 5 amplification product which, due to its size, was 

sequenced using two sets of primers. The Exon 5 sequencing primer sets are set forth below: 

Accession No. AB008822 
Forward primer 8 1 50-8 1 75 

5'- GGCTGTGTGTCTCCTTTAGTTCCTCG -3' (SEQ ID NO:241) 

Reverse primer 8928-8903 
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5'- TGGGAAAGTGGTACGTCTTTOAGTGC -3' 



Forward primer 8674-8697 
5'- TTGACCTCTGTGAAAACAGCGTGC -3' 
Reverse primer 9145-9120 
5'- CTGAAAGCCTCAAGTGCCTGAGAAAC -3 



(SEQ ID NO:243) 



(SEQ ID NO:244) 

(SEQIDNO:245) 

Sequencing reactions were performed using the Big-Dye terminator kit from PE Biosystems (Foster City, 
CA) according to the manu&cturer's instructions. The sequencing products were analyzed on an ABI 477 
automated sequencer (PE Biosystems, Foster City, CA). 



Analysis of Sequences for Polymorphic Sites 

Sequeiices were analyzed for the presence of polymorphisms using the Polyphred program 
(Nickerson et al., \^ Nucleic Adds Res. 2745-2751, 1997). The presence of a polymorphism was 
15 confirmed on both the strands. The polymorphisms and their locations in the OCIF gene are Usted in 
Table 3 below. 



Table 3. Polymorphic sites Identified In the TNFRSFllB Gene 



Polymorphic Site 


Nucleotide Position 


Reference 


Variant 


Example 


Number 




Allele 


Allele 


PSl 


491 (Acc. # AB008821) 


G 


A 


IB 


PS2 


676 (Acc. # AB008821) 


G 


T 


lA 


PS3 


889(Acc.#AB00882n 


C 


T 


lA 


PS4 


916(Acc.#AB008821) 


G 


T 


lA 


PS5 


950(Acc.#AB008821) 


T 


C 


lA 


PS6* 


1181 (Acc.#AB008821) 


G 


C 


lA. IB 


PS7 


1217 (Acc. #AB00882I) 


C 


T 


lA. IB 


PS8 


1294 (Acc. #AB008821) 


G 


A 


lA. IB 


PS9 


1390 (Acc. #AB008821) 


C 


A 


lA 


PSIO 


505 (Acc. # AB008822) 


C 


T 


lA 


PSll 


668 (Acc.#AB008822) 


T 


C 


lA 


PS12 


4397 (Acc.#AB008822) 


T 


C 


lA 


PS13 


4501 (Acc. # AB008822) 


C 


T 


lA 


PS14 


6601 (Acc. # AB008822) 


G 


A 


lA 


PS15 


6824 (Acc. # AB008822) 


C 


T 


lA 


PS16 


6839 (Acc. #AB008822) 


G 


A 


lA, IB 


PS17 


6845 (Acc.#AB008822) 


A 


G 


lA 


PS18 


6893 (Acc. #AB008822) 


A 


G 


lA. IB 


PS19 


6950 (Acc. #AB008822) 


A 


C 


IB 


PS20 


8258 (Acc. # AB008822) 


G 


A 


IB 


PS21 


8391 (Acc. # AB008822) 


T 


C 


IB 


PS22 


8622 (Acc. #AB008822) 


G 


T 


IB 


PS23 


8694 (Acc. #AB008822) 


G 


A 


lA 


PS24 


8955 (Acc. #.ABG08822) 


A 


T 


IB 


PS25 


9003 (Acc. #AB008822) 


T 


C 


lA. IB 



Previously r^oited in the literature. 

20 Example 2 

This example illustrates analysis of the TNFRSFl IB polymorphisms identified in the Index 
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Repositories for human genotypes and haplotypes for all polymorphic sites except FSl and PS16. 

The different genotypes containing these polymorphisms that were observed in these reference 
populations are sho^ in Table 4 below, with flie haplotype pair indicating the combination of haplotypes 
determined for the individual iising the haplotype derivation protocol described below. In Table 4, 
homozygous positions are indicated by one nucleotide and heterozygous positions are indicated by two 
nucleotides. 
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The haplotype pairs shown in Table 4 were estimated from the unphased genotypes using an 
extension of Clark's algorithm (Clark, A.G. (1990) MolBio Evol 7, 1 1 1-122), as described in U.S. 
Provisional Patent Application filed April 19, 2000 and entitled "A Method and System for Determining 
5 Haplotypes from a Collection of Polymorphisms". In this method, haplotypes are assigned directly from 
individuals who are homozygous at ail sites or hetero2ygous at no more than one of the variable sites. 
This list of haplotypes is then used to deconvolute the unphased genotypes in the remaining (multiply 
heterozygous) individuals. 

By following this protocol, it was determined that the Index Repositories examined herein and, 
10 by extension, the general population contains the 27 human TNFRSFI IB haplotypes shown in Table 5 
below. 
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In view of the above, it will be seen that the several advantages of the invention are achieved and 
other advantageous results attained. 

As various changes could be made in the above methods and compositions without departing from 
the scope of the invention, it is intended that all matter contained in the above description and shown in 
the accompanying drawings shall be interpreted as illustrative and not in a limiting sense. 

All references cited in this specification, including patents and patent applications, are hereby 
incorporated in their entirety by reference. The discussion of references herein is intended merely to 
summarize the assertions made by their authors and no admission is made that any reference constitutes 
prior art. Applicants reserve the right to challenge the accuracy and pertinency of the cited references. 
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What is Claimed is: 

1. An isolated polynucleotide comprising a nucleotide sequence selected from the group consisting of: 

(a) a first nucleotide sequence which is a polymorphic variant of a reference sequence for 
Osteoclastogenesis Inhibitory Factor (TNFRSFl IB) gene or a fragment thereof, wherein the 
reference sequence comprises SEQ ID N0S:1- 2, and the polymorphic variant comprises at least 
one polymorphism selected fi-om the group consisting of adenine at PSl, thymine at PS2, thymine 
at PS3, thymine at PS4, cytosine at PS5, thymine at PS7, adenine at PS8, adenine at PS9, thymine 
at PSIO, cytosine at PSll, cytosine at PS12, thymine at PS13, adenine at PS14, thymine at PS15, 
adenine at PS 16, guanine at PS 17, guanine at PS 1 8, cytosine at PS 1 9, adenine at PS20, cytosine at 
PS2 1 , thymine at PS22, adenine at PS23, thymine at PS24 and cytosine at PS25; and 

(b) a second nucleotide sequence which is complementary to the first nucleotide sequence. 

2. The isolated polynucleotide of claim 1 which comprises a TNFRSFl IB isogene. 

3. The isolated polynucleotide of claim 1 which is a DNA molecule and comprises both the first and 
second nucleotide sequences and further comprises expression regulatory elements operably linked 
to the first nucleotide sequence. 

4. A recombinant organism transformed or transfected with the isolated polynucleotide of claim I, 

wherein the organism expresses a TNFRSFl IB protein encoded by the first nucleotide sequence. 

5. The recombinant organism of claim 4 which is a nonhuman transgenic animal. 

6. The isolated polynucleotide of claim 1, wherein the first nucleotide sequence is a polymorphic variant 

of a fragment of the TNFRSFl IB gene, the fragment comprising one or more polymorphisms 
selected from the group consisting of adenine at PSl, thymine at PS2, thymine at PS3, thymine at 
PS4, cytosine at PS5, thymine at PS7, adenine at PS8, adenine at PS9, thymine at PSIO, cytosine at 
PSl 1, cytosine at PS12, thymine at PS13, adenine at PS14, thymine at PS15, adenine at PS16, 
guanine at PS17, guanine at PS18, cytosine at PS19, adenine at PS20, cytosine at PS21, thymine at 
PS22, adenine at PS23, thymine at PS24 and cytosine at PS25. 

7. An isolated polynucleotide comprising a nucleotide sequence which is a polymorphic variant of a 
reference sequence for the TNFRSFl IB cDNA or a fragment thereof, wherein the reference 
sequence comprises SEQ ID N0:3 and the polymorphic variant comprises at least one polymorphism 
selected from the group consisting of thymine at a position corresponding to nucleotide 699, adenine 
at a position corresponding to nucleotide 714, guanine at a position corresponding to nucleotide 720, 
guanine at a position corresponding to nucleotide 768, adenine at a position corresponding to 
nucleotide 841, thymine at a position corresponding to nucleotide 1 102 and cytosine at a position 
corresponding to nucleotide 1 150. 

8. A recombinant organism transformed or transfected with the isolated polynucleotide of claim 7, 
wherein the organism expresses a Osteoclastogenesis Inhibitory Factor (TNFRSFl IB) protein 
encoded by the polymorphic variant sequence. 

9. The recombinant organism of claim 8 which is a nonhuman transgenic animal. 
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10. An isolated polypeptide comprising an amino acid sequence which is a polymorphic variant of a 
reference sequence for the TNFRSFl IB protein or a fragment thereof, wherein the reference 
sequence comprises SEQ ID NO: 4 and the polymorphic variant comprises one or more variant 
amino acids selected from the group consisting of methionine at a position corresponding to amino 
acid position 240, methionine at a position corresponding to amino acid position 281, and serine at a 
position corresponding to amino acid 368. 

11. An isolated antibody specific for and immunoreactive with the isolated polypeptide of claim 10. 

12. A method for screening for drugs targeting the isolated polypeptide of claim 10 which comprises 
contacting the TNFRSFl IB polymorphic variant with a candidate, agent and assaying for binding 
activity. 

13. A composition comprising at least one genotyping oligonucleotide for detecting a polymorphism in 
the Osteoclastogenesis Inhibitory Factor (TNFRSFl IB) gene at a polymorphic site selected from 
PSl, PS2, PS3. PS4, PS5, PS7, PS8. PS9, PSIO. PSll, PS12. PS13. PS14, PS15, PS16. PS17. PS18. 
PS19, PS20, PS21, PS22, PS23, PS24, and PS 25. 

14. The composition of claim 13, wherein the genotyping oligonucleotide is an allele-specific 
oligonucleotide that specifically hybridizes to an allele of the TNFRSFl IB gene at a region 
containing the polymorphic site. 

15. The composition of claim 14, wherein the allele-specific oligonucleotide comprises a nucleotide 
sequence selected from the group consisting of of SEQ ID NOS:5-54, the complements of SEQ ID 
NOS: 5-54. and SEQ ID NOS:55 -150. 

16. The composition of claim 13, wherein the genotyping oligonucleotide is a primer-extension 
oligonucleotide. 

17. A method for genotyping the Osteoclastogenesis Inhibitory Factor (TNFRSFl IB) gene of an 
individual, comprising determining for the two copies of the TNFRSFl IB gene present in the 
individual the identity of the nucleotide pair at one or more polymorphic sites (PS) selected from 
PSl, PS2, PS3, PS4, PS5, PS7, PS8, PS9, PSIO, PSl 1, PS12. PS13, PS14, PS15. PS16, PS17. PS18, 
PS 19, PS20, PS21, PS22, PS23, PS24, and PS 25. 

18. The method of claim 1 7, wherein the determining step comprises: 

(a) isolating from the individual a nucleic acid mixture comprising both copies of the TNFRSFl IB 
gene, or a fragment thereof, that are present in the individual; 

(b) amplifying from the nucleic acid mixture a target region containing at least one of the 
polymorphic sites; 

(c) hybridizing a primer extension oligonucleotide to one allele of the amplified target region; 

(d) performing a nucleic acid template-dependent, primer extension reaction on the hybridized 
genotyping oligonucleotide in the presence of at least two different terminators of the reaction, 
wherein said terminators are complementary to the alternative nucleotides present at the 
polymorphic site; and 
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(e) detecting the presence and identity of the temunator in the extended genotyping 
oligonucleotide. 

19. A method for haplotyping the Osteoclastogenesis Inhibitory Factor (TNFRSFl IB) gene of an 
individual which comprises determining, for one copy of the TNFRSFl IB gene present in the 
individual, the identity of the nucleotide at one or more polymorphic sites (PS) selected from FSl, 
PS2. PS3, PS4, PS5, PS7, PS8, PS9, PS 10, PSl 1, PS12, PS13, PS14, PS15, PS16, PS17, PS18. 
PS19, PS20, PS21, PS22. PS23, PS24, and PS 25. 

20. The method of claim 19, wherein the determining step comprises 

(a) isolating from the individual a nucleic acid molecule containing only one of the two copies of 
the TNFRSFl IB gene, or a fragment thereof, that is present in the individual; 

(b) amplifying from the nucleic acid molecule a target region containing at least one of the 
polymorphic sites; 

(c) hybridi2dng a primer extension oligonucleotide to one allele of the amplified target region; 

(d) performing a nucleic acid template-dependent, primer extension reaction on the hybridized 
genotyping oligonucleotide in the presence of at least two different terminators of the reaction, 
wherein said terminators are complementary to the alternative nucleotides present at the 
polymorphic site; and 

(e) detecting the presence and identity of the terminator in the extended genotyping 
oligonucleotide. 

21. A method for predicting a haplotype pair for the Osteoclastogenesis Inhibitory Factor (TNFRSFl IB) 
gene of an individual comprising: 

(a) identifying an TNFRSFl IB genotype for the individual at two or more of polymorphic sites 
selected from PSl, PS2, PS3, PS4, PS5, PS7, PS8, PS9, PSIO, PSll, PS12, PS13, PS14, PS15, 
PS16, PS17, PS18, PS19. PS20. PS21, PS22, PS23, PS24, and PS 25; 

(b) enumerating all possible haplotype pairs which are consistent with the genotype; 

(c) accessing data containing the TNFRSFl IB haplotype pairs determined in a reference 
population; and 

(d) assigning a haplotype pair to the individual that is consistent with the data. 

22. A method for identifying an association between a toait and at least one genotype or haplotype of the 

Osteoclastogenesis Inhibitory Factor gene which comprises comparing the frequency of the genotype 

or haplotype in a population exhibiting the trait with the frequency of the genotype or haplotype in a ' 

reference population, wherein the genotype or haplotype comprises a nucleotide pair or nucleotide 

located at one or more polymorphic sites selected from PSl, PS2, PS3, PS4, PSS, PS7, PSS, PS9, 

PSIO, PSll, PS12, PS13, PS14, PS15,PS16, PS17, PS18, PS19, PS20, PS21, PS22, PS23, PS24, 

and PS 25, wherein a higher frequency of the genotype or haplotype in the trait population than in 

the reference population indicates the trait is associated with the genotype or haplotype. 

23 The method of claim 22, wherein the haplotype is selected from haplotype numbers 1 -27 shovm in 
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Table 5. 

24. The method of claim 23, wherein the trait is a clinical response to a drug targeting TNFRSFl IB . 

25. A computer system for storing and analyzing polymorphism data for the Osteoclastogenesis 
Inhibitory Factor gene, comprising: 

(a) a central processing unit (CPU); 
5 (b) a communication interface; 

(c) a display device; 

(d) an input device; and 

(e) a database containing the polymorphism data; 

wherein the polymorphism data comprises the genotypes and haplotype pairs shown in Table 4 and 
10 the haplotypes shown in Table S. 

26. A genome anthology for the Osteoclastogenesis Inhibitory Factor (TNFRSFl IB) gene which 
comprises TNFRSFl IB isogenes defined by haplotypes 1-27 shown in Table S. 
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POLYMORPHISMS IN THE TNFRSFllB GENE (Accession No. AB008821) 



CTGGAGACAT ATAACTTGAA CACTTGGCCC TGATGGGGAA GCAGCTCTGC 
AGGGACTTTT TCAGCCATCT GTAAACAATT TCAGTGGCAA CCCXXIGAACT 100 
GTAATCCATG AATGGGACCA CACTTTACAA GTCATCAAGT CTAACTTCTA 
GACCAGGGAA TTAATGGGGG AGACAGCGAA CCCTAGAGCA AAGTGCCAAA 200 
CTTCTGTCGA TAGCTTGAGG CTAGTGGAAA GACCTCGAGG AGGCTACTCC 
AGAAGTTCAG CGCGTAGGAA GCTCCGATAC CAATAGCCCT TTGATGATGG 300 
TGGGGTTGGT GAAGGGAACA GTGCTCCGCA AGGTTATCCC TGCCCCAGGC 
AGTCCAATTT TCACTCTGCA GATTCTCTCT GGCTCTAACT ACCCCAGATA 400 
ACAAGGAGTG AATGCAGAAT AGCACGGGCT TTAGGGCCAA TCTIGACATTA 
GTTAGAAAAA TTCCTACTAC ATGGTTTATG TAAACTTGAA GATGAATGAT 500 

A 

TGCGAACTCC CCGAAAAGGG CTCAGACAAT GCCATGCATA AAGAGGGGCC 
CTGTAATTTG AGGTTTCAGA ACCCGAAGTG AAGGGGTCAG GCAGCCGGGT 600 
ACGGCGGAAA CTCACAGCTT TCGCCCAGCG AGAGGACAAA GGTCTGGGAC 
ACACTCCAAC TGCGTCCGGA TCTTGQCTGG ATCGGACTCT CAGGGTGGAG 700 

T 

GAGACACAAG CACAGCAGCT GCCCAGCGTG TGCCCAGCCC TCCCACCGCT 
GGTCCCGGCT GCCAGGAGGC TGGCCGCTGG CGGGAAGGGG CCGGGAAACC 800 
TCAGAGCCCC GCGGAGACAG CAGCCXSCCTT GTTCCTCAGC CCGGTGGCTT 
TTTTTTCCCC TGCTCTCCCA GGGGACAGAC ACCACCGCCC CACCCCTCAC 900 

T 

GCCCCACCTC CCTGGGGGAT CCTTTCCGCC CCAGCCCTGA AAGCGTTAAT 
T C 
CCTGGAGCTT TCTGCACACC CCCCGACCGC TCCCGCCCAA GCTTCCTAAA 1000 
AAAGAAAGGT GCAAAGTTTG GTCCAGGATA GAAAAATGAC TGATCAAAGG 
CAGGCGATAC TTCCTGTTGC CGGGACGCTA TATATAACGT GATGAGCGCA 1100 
CGGGCTGCGG AGACGCACCG GAGCGCTCGC CCAGCCGCCG CCTCCAAGCC 
CCTGAGGTTT CCGGGGACCA C M^TG AACAA GTTGCTGTGC TGCGCGrCTC^ 1200 

c" 

TGGTAAGTCC CTGGGCCAGC CGACGGGTGC CCGGCGCCTG GGGAGGCTGC 
T 

TGCCACCTGG TCTCCCAACC TCCCAGCGGA CCGGCGGGGA GAAGGCTCCA 13 00 

A 

CTCGCTCCCT CCCAGGAGAG GCTTGGGGTT AGGCTGGAGC AGGAAACCGC 
TTTCAAGTTA TGCCATGCTT CCCCTAGGGT GTCCTTTTAC GCTGCAAAGT 1400 

A 

TCCTGCTGAC TTTATGGAAG ACAGCAAGAG AGAGACAGAC AGCGAGAGAG 
AGGGAGAGAG AGAGAGAGAG AAACTTGTTT GAAAGTTTTA GTCATTAACC 1500 
TTCTGTCTTG ATCTCAGAAT ATTAACGCCC TCATGTAGTC CATACTATCT 
TTGCTTAATG AACTTGAACT TTTATTATTA GTGGCAAAGA AGTGGTCCCT 1600 
TAGATTCAGA GTAAGTTGGA AGAAGACGTT AGTCTTCTTA AAACCATTAT 
AATTAGAATA TGACATGATA GATTTTTCTA A 1681 
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POLYMORPHISMS IN THE TNFRSFllB GENE (Accession No. AB008822) 

AAGCTTACTT TGTGCCAAAT CTCATTAGGC TTAAGGTAAT ACAGGACTTT 
GAGTCAAATG ATACTGTTGC ACATAAGAAC AAACCTATTT TCATGCTAAG 100 
ATGATGCCAC TGTGTTCCTT TCTCCTTCTA GTTT Cf GgA C ATCTCCATO 
AG TGGAC CAC CCAGGAjACG" TTTCCTCCAA AGTACCTgCA TTATGACGAA 200 
.GAAACCTCTC ATCAGCT^ 

AAAACAACAC "TGfAdVGCAA"ll GTGGAAGAC "CG TCT "CCffG CCCTG _ _ _..?_93 

AC^StACTA j:ACAGACAGC"'TGGCACACCA GTGACGAGTG TCTAT ACTGC' 

A(S:cccGTCT gcaSsgagct gcagt acgtc aagcaggagt" GCJ^CGCAC 4 0 

CCACAACCGC ' GTCTC 

TCIGCTIGPiA ACi ^AGG A^ TGKicCTCCtG" GATTTGG AGT GGTGCAA GCT ' "SOi? 
([^TACGTGfc AATGTGCAGC AAAATTAATT AGGATO^TOC W^GTCSlGA 
T 

AGTTGTGACA GTTTAGGAGA ACACTTTTGT TCTGATGACA TTATAGGATA 600 
GCAAATTGCA AAGGTAATGA AACCTGCCAG GTAGGTACTA TGTGTCTGGA 
GTGCTTCCAA AGGACCATTG CTCAGAGGAA TACTTTGCCA CTACAGGGCA 700 

c 

ATTTAATGAC AAATCTCAAA TGCAGCAAAT TATTCTCTCA TGAGATGCAT 
GATGGTTTTT tTTTTTTTTT TTAAAGAAAC AAACTCAAGT TGCACTATTG 800 
ATAGTTGATC TATACCTCTA TATTTCACTT CAGCATGGAC ACCTTCAAAC 
TGCAGCACTT TTTGACAAAC ATCAGAAATG TTAATTTATA CCAAGAGAGT 900 
AATTATGCTC ATATTAATGA GACTCTGGAG TGCTAACAAT AAGCAGTTAT 
AATTAATTAT GTAAAAAATG AGAATGGTGA GGGGAATTGC ATTTCATTAT 1000 
TAAAAACAAG GCTAGTTCTT CCTTTAGCAT GGGAGCTGAG TGTTTGGGAG 
GGTAAGGACT ATAGCAGAAT CTCTTCAATG AGCTTATTCT TTATCTTAGA 1100 
CAAAACAGAT TGTCAAGCCA AGAGCAAGCA CTTGCCTATA AACCAAGTGC 
TTTCTCTTTT GCATTTTGAA CAGCATTGGT CAGGGCTCAT GTGTATTGAA 1200 
TCTTTTAAAC CAGTAACCCA CGTTTTTTTT CTGCCACATT TGCGAAGCTT 
CAGTGCAGCC TATAACTTTT CATAGCTTGA GAAAATTAAG AGTATCCACT 1300 
TACTTAGATG GAAGAAGTAA TCAGTATAGA TTCTGATGAC TCAGTTTGAA 
GCAGTGTTTC TCAACTGAAG CCCTGCTGAT ATTTTAAGAA ATATCTGGAT 1400 
TCCTAGGCTG GACTCCTTTT TGTGGGCAGC TGTCCTGCGC ATTGTAGAAT 
TTTGGCAGCA CCCCTGGACT CTAGCCACTA GATACCAATA GCAGTCCTTC 1500 
CCCCATGTQA CAGCCAAAAA TGTCTTCAGA CACTGTCAAA TGTCGCCAGG 
TGGCAAAATC ACTCCTGGTT GAGAACAGGG TCATCAATGC TAAGTATCTG 1600 
TAACTATTTT AACTCTCAAA ACTTGTGATA TACAAAGTCT AAATTATTAG 
ACGACCAATA CTTTAGGTTT AAAGGCATAC AAATGAAACA TTCAAAAATC 1700 
AAAATCTATT CTGTTTCTCA AATAGTGAAT CTTATAAAAT TAATCACAGA 
AGATGCAAAT TGCATCAGAG TCCCTTAAAA TTCCTCTTCG TATGAGTATT 1800 
TGAGGGAGGA ATTGGTGATA GTTCCTACTT TCTATTGGAT GGTACTTTGA 
GACTCAAAAG CTAAGCTAAG TTGTGTGTGT GTCAGGGTGC GGGGTGTGGA 1900 
ATCCCATCAG ATAAAAGCAA ATCCATGTAA TTCATTCAGT AAGTTGTATA 
TGTAGAAAAA TGAAAAGTGG GCTATGCAGC TTGGAAACTA GAGAATTTTG 2000 
AAAAATAATG GAAATCACAA GGATCTTTCT TAAATAAGTA AGAAAATCTG 
TTTGTAGAAT GAAGCAAGCA GGCAGCCAGA AGACTCAGAA CAAAAGTACA 2100 
CATTTTACTC TGTGTACACT GGCAGCACAG TGGGATTTAT TTACCTCTCC 
CTCCCTAAAA ACCCACACAG CGGTTCCTCT TGGGAAATAA GAGGTTTCCA 2200 
GCCCAAAGAG AAGGAAAGAC TATGTGGTGT TACTCTAAAA AGTATTTAAT 
AACCGTTTTG TTGTTGCTGT TGCTGTTTTG AAATCAGATT GTCTCCTCTC 2300 
CATATTTTAT TTACTTCATT CTGTTAATTC CTGTGGAATT ACTTAGAGCA 
AGCATGGTGA ATTCTCAACT GTAAAGCCAA ATTTCTCCAT CATTATAATT 2400 
TCACATTTTG CCTGGCAGGT TATAATTTTT ATATTTCCAC TGATAGTAAT 
AAGGTAAAAT CATTACTTAG ATGGATAGAT CTTTTTCATA AAAAGTACCA 2500 
TCAGTTATAG AGGGAAGTCA TGTTCATGTT CAGGAAGGTC ATTAGATAAA 
GCTTCTGAAT ATATTATGAA ACATTAGTTC TGTCATTCTT AGATTCTTTT 2600 
TGTTAAATAA CTTTAAAAGC TAACTTACCT AAAAGAT^TA TCTGACACAT 
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ATGAACTTCT 
CTGAAGAATG 
ATCTCAAGAG 
TTCAAGACCA 
CAAAAATTAG 
GAGGCTGAGA 
AGCTGAGATC 
CGTCCCTGCC 
CAGAACATAC 
CACTTATTTC 
CAAATACCTC 
ATTTAAGGAG 
AGA/i^CCCT 
ATGCTGGAAA 
TCCTGTTCTT 
TCATGTAGAA 
TTTTTTTATT 
GATTTATAAA 
CATATTTGTT 
GGATATTGAA 
TTTCATATTC 
TCAAGTGTTT 
GTTCAAGTTT 
ATCATGAAGT 
TCTGCTGTTT 
ATGTAGCTTA 
GCCACCTTCA 
TAGCATACTT 
TCT^GTAGGAA 
TTTCAAAGAA 
AGGAATAATT 
CAGCGGAGGC 
CATTTCATGA 
TACAAGGAGG 
TCTAATGAAG 

AGCAGTAAAA 
TCTGCTGGGA 
CGTAC ^SAACC 
it. 



CATTAGGATG 
AACAAGATTC 
TTTGAGAGGT 
GCCTGGCCAA 
CAGGGCATGG 
CAGGAGAATC 
CCTCTACTGC 
GCCGCCCCCG 
GGCAGTCAAC 
GAGTAAATTA 
TGCTTATGAT 
TAGGATGTAG 
CTTTGCTTTG 
TAATATTTAA 
TTTTTCTGCC 
AGAGACAGGA 
TTCTGGTTTT 
TGAAGTTTAA 
ATCTTGCATA 
GTCTAAATCT 
AGATACACTG 
GAAGGTATTT 
TTCTGCCAAT 
AAAATGCCCT 
TAAACAGTTT 
ACTGCAGGCT 
AAAGTTTATT 
AGGAGTTGCT 
CTGATTGGAA 
TGATATTACA 
GTATGAAGAA 
TGGACTAATG 
GCGTTTTGTA 
AGAAACTGGC 
TGAAAAATGA 

ACCAAGTGAA 
AACGATTTGA 
iiCAGAGCGAA" 



CAG6AGAAGA 

TTAGGCCCGG 
CAAGGCGGGC 
CATGATGAAA 
TGGTGCATGC 
TCTTGAACCC 
ACTCCAGCCT 
CCTTCCCCCC 
AAAGGGAGAC 
GCAATGAAAG 
ATTGTAGAAT 
TAGGAAAGTA 
TAAGGTGGTT 
TATGTGAAGG 
AGCCCTTTGT 
GATGAAACTA 
GGTAAAA6AT 
TAAGTTTCTG 
AGCCAGAATT 
GTTCAACTAG 
GAATGTATGA 
ATTTTTAATA 
GATTTCTTCA 
TGCAGTCACC 
AAGCAAATGG 
TACGCTTTTG 
ATAATGTTGT 
TCACAATTAG 
TTTAATGATG 
GCAGACACAC 
TATGGCTGAC 
AACACCCTAC 
GGTAACGAGA 
AAAGGGGATG 
AAATGCTAGA 

AAGTCTTTCC 
GGAGAAGGTA 
ATACAGTTTG 



CCCAAGCCAC 
CACGGTGGCT 
AGATCACCTG 
CCCTGCCTCT 
CTGCAACCCT 
TCGAGGCGGA 
GGGTGACAGA 
CT^AAAAGATT 
CTGGGTCCAG 
AATGCCATGG 
TTGATATAGA 
CTAAAAACAA 
CCTAAGATAA 
TTTTAGGCTG 
CATTTTTGCA 
GAACCAGTCC 
ACAATGAGGT 
TAGCTTTGAT 
GGCCTGTAAA 
CTTACACTAG 
TCTAGCCATG 
GCGTCTTTAG 
AATTTATCAA 
CTTCCTGAAG 
TATATCATCT 
AGTCAGCGGC 
AAATTTTTAC 
GATTCAGGAA 
CAGCATTCAA 
AGCAGTTATC 
AACACGGCCT 
CCTTCTTTCC 
AAATTGACTT 
ATGGTGGAAG 
GTTTTGTGCA 

AAAACTGTGT 
CTAAATTGCT 



AGATATGTAT 
CACATCTGTA 
AGGTCAGGAG 
ACTAAAAATA 
AGCTACTCAG 
GGTTGTGGTG 
GATGAGACTC 
CTTCTTCATG 
GTGTCCAAGT 
AATCCCTGCC 
GTTGTATCCC 
ACACACAAAC 
TGTCAGTGCA 
TGTTTTCCCC 
GGTCAATGAA 
ATTTTGCCCC 
AGGAGGTTGA 
TTTTCTCTTT 
ATCTACATAT 
ATGGAGATAT 
CGTAATATAG 
TTGTGGACTG 
ATATTTTTCC 
TTTGAACGAC 
TCCGTTTACT 
CAACTTTATT 
TTCTCAAGGT 
AGAAAGAACT 
TGGGTACTAA 
TTGATTTTCT 
TACTGCCACT 
TTTCCTCTCA 
GCATTTGCAT 
TTTTGTTCTG 
ACATAATAGT 
C 

TAAGAGGGCA 
TGGTATTTTC 



CAAAAGAT.GT .■ CCKSA TGGGTj 



rCTTC ryC^ A A.igGAGA CGTCA T^PAAGCAC 



tAACATAtGT 
ATTACATTCC 
TCTGAGTTGA 
TTCCCTTTCT 
CTAAAGCTAC 
ACCTCAAAGC 
ATAGGCAAAG 
CCAAACTGTA 
TTATTAACTA 
TGATGACAAA 
TTTGGGGGGC 
GCCTGACCAA 
CTGGGCATGG 
GGAGAATCGC 
ACCACTGCAC 



ETGGT CrCCT 
TCCGGA^CA_ 
AAAATACGTC" 
ACACAAGGCC 
TGAATCTTAA 
CACTCAGAAT 
TTGATTTTCT 
GGCAGTGTCA 
GAATTCACGT 
AAGTATATAT 
TTAGGCCAGG 
CAAGGTAGGC 
CATGGTGAAA 
TAGCAGGCAC 
TTGAACCCAG 
TCCAGTCTGG 



GTGAATCAAC 



TTTGTACGAT 
TCCAGCCACA 
CCAGCTAAGG 
CTCTCAAAAA 
CTCCTTTCAC 
AGTTTGCCAC 
TGTGTGTTAT 
TGGCAACTAA 
CATGGTGGCT 
AGATCACTTG 
CCTTGTCTCT 
TTCTAGTACC 
GAGATGGAGG 
GCAACAGAGC 



GGTGTAGAAA 

aaagga^t'g 
t^aaaatgt 

TTTGTAGTAT 
TTCTTGGTCA 
CTACTCTCGA 
CTCATCTTCT 
ACTGAAATCA 
TGAGATGAAA 
TACTTTCACG 
GAAGCAAAGT 
TACTCCTATA 
AGGTCAGGAT 
ACTAAAAATA 
AGCTACTCAG 
TTGCAGTGAG 
AAGATTTCAT 



ACAGACAAAT 
CA^ACACGA 
GGAATAGGTA 
CATCTCfCTC 
AACTTACATT 
TGCATTACTG 
CACAGATAAC 
AATCTTGCCC 
TTAGGAGAGT 
AATGTCTGTA 
GATATAAACA 
ATCCCAACAT 
TTCAAGACCA 
CAAAAATTAG 
GGCTGAGGCA 
CTGAGATTGT 
CACACACACA 
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3100 
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CACACACACA CACACACACA CATTAGAAAT GTGTACTTGG CTTTGTTACC 
TATGGTATTA GTGCATCTAT TGCATGGAAC TTCCAAGCTA CTCTGGTTGT 5500 
GTTAAGCTCT TCATTGGGTA CAGGTCACTA GTATTAAGTT CAGGTTATTC 
GGATGCATTC CACGGTAGTG ATGACAATTC ATCAGGCTAG TGTGTGTGTT 5600 
CACCTTGTCA CTCCCACCAC TAGACTAATC TCAGACCTTC ACTCAAAGAC 
ACATTACACT AAAGATGATT TGCTTTTTTG TGTTTAATCA AGCAATGGTA 5700 
TAAACCAGCT TGACTCTCCC CAAACAGTTT TTCGTACTAC AAAGAAGTTT 
ATGAAGCAGA GAAATGTGAA TTGATATATA TATGAGATTC TAACCCAGTT 5800 
CCAGCATTGT TTCATTGTGT AATTGAAATC ATAGACAAGC CATTTTAGCC 
TTTGCTTTCT TATCTAAAAA AAAAAAAAAA AAAATGAAGG AAGGGGTATT 5900 
AAAAGGAGTG ATCAAATTTT AACATTCTCT TTAATTAATT CATTTTTAAT 
TTTACTTTTT, TTCATTTATT GTGCACTTAC TATGTGGTAC TGTGCTATAG 6000 
AGGCTTTAAC ATTTATAAAA ACACTGTGAA AGTTGCTTCA GATGAATATA 
GGTAGTAGAA CGGCAGAACT AGTATTCAAA GCCAGGTCTG ATGAATCCAA 6100 
AAACAAACAC CCATTACTCC CATTTTCTGG GACATACTTA CTCTACCCAG 
ATGCTCTGGG CTTT6TAATG CCTATGTAAA TAACATAGTT TTATGTTTGG 6200 
TTATTTTCCT ATGTAATGTC TACTTATATA TCTGTATCTA TCTCTTGCTT 
TGTTTCCAAA GGTAAACTAT GTGTCTAAAT GTGGGCAAAA AATAACACAC 6300 
TATTCCAAAT TACTGTTCAA ATTCCTTTAA GTCAGTGATA ATTATTTGTT 
TTGACATTAA TCATGAAGTT CCCTGTGGGT ACTAGGTAAA CCTTTAATAG 6400 
AATGTTAATG TTTGTATTCA TTATAAGAAT TTTTGGCTGT TACTTATTTA 
CAACAATATT TCACTCTAAT TAGACATTTA CTAAACTTTC TCTTGAAAAC 6500 
AATGCCCAAA AAAGAACATT AGAAGACACG TAAGCTCAGT TGGTCTCTGC 
CACTAAGACC AGCCAACAGA AGCTTGATTT TATTCAAACT TTGCATTTTA 6600 
GCATATTTTA TCTTGGAAAA TTCAATTGTG TTGGTTTTTT GTTTTTGTTT 
A 

GTATTGAATA GACTCTCAGA A ATCCAATTG TTG AGTAAAT CTT CTGGGTT 6700 

TTCTAACCTT. TCTTTAG|ATG TTAgGC^TG" T^GGAQ®^ TTCT TCAGSTi _ 

TfGCTGfTCC"'TAGAAAGTTT ACGC GGCTTAGTGT CTTGGTAGAG" 68 0.0i 

AATTTGCCTG -GGACCAAAGT AAA'CGCAGAG AGTGTAGAGA GGATAA^CG 

' ^ . . 

(3CAACACAGC TCACAAGi^C AGACTTTCCA GCTGCTGAAG TT ATGGAAAC ..JJ.9_S 
J 4- . ■ ■i*^ * : • yri- ■• Gj 

AtCC AAA ACAA AGACCAA G AT ATAGTCAA GA AGATCATCdA AjgGTATQATA 

C 

ATCTAAAATA AAAAGATCAA TCAGAAArCA AAGACACCTA TTTATCATAA 7000 
ACCAGGAACA AGACTGCATG TATGTTTAGT TGTGTGGATC TTGTTTCCCT 
GTTGGAATCA TTGTTGGACT GAAAAAGTTT CCACCTGATA ATGTAGATGT 7100 
GATTCCACAA ACAGTTATAC AAGGTTTTGT TCTCACCCCT GCTCCCCAGT 
TTCCTTGTAA AGTATGTTGA ACACTCTAAG AGAAGAGAAA TGCATTTGAA 7200 
GGCAGGGCTG TATCTCAGGG AGTCGCTTCC AGATCCCTTA ACGCTTCTGT 
AAGCAGCCCC TCTAGACCAC CAAGGAGAAG CTCTATAACC ACTTTGTATC 7300 
TTACATTGCA CCTCTACCAA GAAGCTCTGT TGTATTTACT TGGTAATTCT 
CTCCAGGTAG GCTTTTCGTA GCTTACAAAT ATGTTCTTAT TAATCCTCAT 7400 
GATATGGCCT GCATTAAAAT TATTTTAATG GCATATGTTA TGAGAATTAA 
TGAGATAAAA TCTGAAAAGT GTTTGAGCCT CTTGTAGGAA AAAGCTAGTT 7500 
ACAGCAAAAT GTTCTCACAT CTTATAAGTT TATATAAAGA TTCTCCTTTA 
GAAATGGTGT GAGAGAGAAA CAGAGAGAGA TAGGGAGAGA AGTGTGAAAG 7600 
AATCTGAAGA AAAGGAGTTT CATCCAGTGT GGACTGTAAG CTTTACGACA 
CATGATGGAA AGAGTTCTGA CTTCAGTAAG CATTGGGAGG ACATGCTAGA 7700 
AGAAAAAGGA AGAAGAGTTT CCATAATGCA GACAGGGTCA GTGAGAAATT 
CATTCAGGTC CTCACCAGTA GTTAAATGAC TGTATAGTCT TGCACTACCC 7800 
TAAAAAACTT CAAGTATCTG AAACCGGGGC AACAGATTTT AGGAGACCAA 
CGTCTTTGAG AGCTGATTGC TTTTGCTTAT GCAAAGAGTA AACTTTTATG 7900 
TTTTGAGCAA ACCAAAAGTA TTCTTTGAAC GTATAATTAG CCCTGAAGCC 
GAAAGAAAAG AGAAAATCAG AGACCGTTAG AATTGGAAGC AACCAAATTC 8000 
CCTATTTTAT AAATGAGGAC ATTTTAACCC AGAAA6ATGA ACCGATTTGG 
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CTTAGGGCTC ACAGATACTA AGTGACTCAT GTCATTAATA GAAATGTTAG 8100 
TTCCTCCCTC TTAGGTTTGT ACCCTAGCTT ATTACTGAAA TATTCTCTAG 
GCTGTGTGTC TCCTTTAGTT CCTCGACCTC ATGTCTTTGA GTTTTCAGAT 8200 
ATCCTCCTCA TGGAGGTAGT CCTCTGGTGC TATGTGTATT CTTTAAAGGC 
TAGTTACGGC AATTAACTTA TCAACTAGCG CCTACTAATG AAACTTTGTA 8300 
A 

TTACAAAGTA GCTAACTTGA ATACTTTCCT TTTTTTCTGA AATGTTATGG 
TGGTAATTTC TCAAACTTTT TCTTAGAAAA CTGAGAGTGA TGTGTCTTAT 8400 

C 

TTTCTACTGT TAATTTTCAA AATTAGGAGC TTCTTCCAAA GTTTTGTTGG 
ATGCCAAAAA TATATAGCAT ATTATCTTAT TATAaCAAAA AATATTTATC 8500 
TCAGTTCTTA GAAATAAATG GTGTCACTTA ACTCCCTCTC AAAAGAAAAG 
GTTATCATTG AAATATAATT ATGAAATTCT GCAAGAACCT TTTGCCTCAC 8600 
GCTTGTTTTA TGATGGCATT GGATGAATAT AAATGATGTG AACACTTATC 

T 

TGGGCTTTTG CTTTATGCAG jATATTGACCT CTGTGAAAAC AGCGTGCAGC "8700| 

-" -j^ ^.-—rr- --^ ^ ^ -^.^ S.AJ'""' 

GGCACATTGG AgATQ CTAAC CTCAC CTTCG AGCAGCTTCG "tAGCTTGATG 

^^AGCTTAC CGGG AAAG^^^I^ GAAG ACg£TG l^^iAAj^CA Ay" .880^ 

jiAAGGCMGC AflACCOiGTO ACC^ATCCT _ 
GAATAA AAAA TGGCGACCAA" GACA CC"!^^ '_ 8900| 

AAGCACTGAA AQAGGT AC5^^ CTtTfcci^^ ACtCTCAeTG; ^AGWTAAS_ 

jSAAGA CCATC. AGGT T CCTTC ACAGC TTC AC AATGTA , CAAA ^TGTATCAGA" "900^ 

j • ■;i^-T " 2 " . 

AGTTA TTTTT AGAAATGATA GGTAACCAGG TCCAATGAGT ;4AAAATAAGCj 

TGCTT ATAfli C TGGAAATGGC CATTGAGCTG TTTCCTCACA ATTGGCGAGA 
TCCCATGGAT GAGTAAACTG TTTCTCAGGC ACTTGAGGCT TTCAGTGATA 
TCTTTCTCAT TACCAGTGAC TAATTTTGCC ACAGGGTACT AAAAGAAACT 
ATGATGTGGA GAAAGGACTA ACATCTCCTC CAATAAACCC caaatggtta 
ATCCAACTGT CAQATCTGGA TCGTTATCTA CTGACTATAT TTTCCCTTAT 
TACTGCTTGC AGTAATTCAA CTGGAAAtTA AAAAAAAAAA ACTAGACTCC 
ACTGGGCCTT ACTAAATATG GGAATGTCTA ACTTAAATAG CTTTGGGATT 
CCAGCTATGC TAGAGGCTTT TATTAGAAAG CCATATTTTT TTCTGTAAAA 
GTTACTAATA TATCTGTAAC ACTATTACAG TATTGCTATT TATATTCATT 
CAGATATAAG ATTTGGACAT ATTATCATCC TATAAAGAAA CGGTATGACT 
TAATTTTAGA AAGAAAATTA TATTCTGTTT ATTATGACAA ATGAAAGAGA 
AAATATATAT TTTTAATGGA AAGTTTGTAG CATTTTTCTA ATAGGTACTG 
CCATATTTTT CTGTGTGGAG TATTTTTATA ATTTTATCTG TATAAGCTGT 
AATATCATTT TATAGAAAAT GCATTATTTA GTCAATTGTT TAATGTTGGA 
AAACATATGA AATATAAATT ATCTGAATAT TAGATGCTCT GAGAAATTGA 
ATGTACCTTA TTTAAAAGAT TTTATGQTTT TATAACTATA TAAATGACAT 
TATTAAAGTT TTCAAATTA 



9100 
9200 
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ATGAACRAGT TGCTGTGCTG CGCGCTCGTG 
C 

GTGGACCACC CAGGAAACGT TTCCTCCAAA 
AAACCTCrCA TCAGCTGTTG TGTGACAAAT 
AAACAACACT GTACAGCAAA GTGGAAGACC 
CCACTACTAC ACAGACAGCT GGCACACCAG 
GCCCCGTGTG CAAGGAGCTG CAGTACGTCA 
CACAACCGCG TGTGCGAATG CAAGGAAGGG 
CTGCTTGAAA CATAGGAGCT GCCCTCCTGG 
GAACCCCAGA GCGAAATACA GTTTGCAAAA 
TCAAATGAGA CGTCATCTAA AGCACCCTGT 
TGTCTTTGGT CTCCTGCTAA CTCAGAAAGG 
TATGTTCCGG AAACAGTGAA TCAACTCAAA 
CT6TGTGAGG AGGCATTCTT CAGGTTTGCT 
TAACTGGCTT AGTGTCTTGG TAGACAATTT 

CAGAGAGTGT AGAGAGGATA AAACGGCAAC 
A G 

TTCCAGCTGC TGAAGTTATG GAAACATCAA 
6 

CAAGAAGATC ATCCAAGATA TTGACCTCTG 

ACATTGGACA TGCTAACCTC ACCTTCGAGC 
AGCTTACCGG GAAAGAAAGT GGGAGCAGAA 
GGCATGCAAA CCCAGTGACC AGATCCTGAA 
TAAAAAATGG CGACCAAGAC ACCTTGAAGG 
CACTCAAAGA CGTACCACTT TCCCAAAACT 
GACCATCAGG TTCCTTCACA GCTTCACAAT 
T 

TATTTTTAGA AATGATAGGT AACCAGGTCC 
TTATAA 



TTTCTGGACA TCTCCATTAA 

GTACCTTCAT TATGACGAAG 100 
GTCCTCCTGG TACCTACCTA 
GTGTGCGCCC CTTGCCCTGA 200 
TGACGAGTGT CTATACTGCA 
AGCAGGAGTG CAATCGCACC 300 
CGCTACCTTG AGATAGAGTT 
ATTTGGAGTG GTGCAAGCTG 400 
GATGTCCAGA TGGGTTCTTC 
AGAAAACACA CAAATTGCAG 500 
AAATGCAACA CACGACAACA 
AATGTGGAAT AGATGTTACC 600 
GTTCCTACAA AGTTTACGCC 
GCCTGGCACC AAAGTAAACG 700 

T 

ACAGCTCACA AGAACAGACT 

AACAAAGACC AAGATATAGT 800 

TGAAAACAGC GTGCAGCGGC 
A 

AGCTTCGTAG CTTGATGGAA 900 
GACATTGAAA AAACAATAAA 
GCTGCTCAGT TTGTGGCGAA 1000 
GCCTAATGCA CGCACTAAAG 
GTCACTCAGA GTCTAAAGAA 1100 
GTACAAATTG TATCAGAAGT 

C 

AATCAGTAAA AATAAGCTGC 1200 

1206 
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MNKLLCCALV FLDISIKWTT QETFPPKYLH 
N 

KQHCTAKWKT VCAPCPDHYY TDSWHTSDEC 
HNRVCECKEG RYLEIEFCLK HRSCPPGFGV 
SNETSSKAPC RKHTNCSVFG LLLTQKGNAT 
LCEEAFPRFA VPTKFTPNWL SVLVDNLPGT 

FQLLKLWKHQ NKDQDIVKKI IQDIDLCENS 

SLPGKKVGAE DIEKTIKACK PSDQILKLLS 
HSKTYHFPKT VTQSLKKTIR FLHSFTMYKL 
S 

L 



YDEETSHQLL CDKCPPGTYL 

LYCSPVCKEL QYVKQECNRT 100 
VQAGTPERNT VCKRCPDGFF 
HDNICSGNSE STQKCGIDVT 200 
KVNAESVERI KRQHSSQEQT 
cM 

VQRHIGHANL TFEQLRSLME 300 
M 

LWRIKNGDQD TLKGLMHALK 
YQKLFLEMIG NQVQSVKISC 400 

401 
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SEQaEMCE LISTING 



<110> Genaissance Pharmaceuticals, Inc. 
Chew, Anne 
Denton, R. Rex 
Ouda, Amy 

Nandabalan, Krishnan 
Stevens, J. Claiborne 

<120> Drug Target Isogenes:' Polymorphisms in the 
Osteoclastogenesls Inhibitory. Factor 



<130> MWH-OOIPCT TNFRSFllB-OCIF 

<140> TBA 

<141> 2000-07-10 

<150> 60/143,020 
<151> 1999-07-09 

<160> 245 

<170> Patentiri Ver. 2.1 



<210> 1 

<2H> 1681 

<212> DNA 

<213> Homo sapiens 

<400> i 

ctggagacat ataacttgaa cacttggccc tgatggggaa gcagctctgc agggactttt 60 

tcagccatct gtaaacaatt tcagtggcaa cccgcgaact gtaatccatg aatgggacca 120 

cactttacaa gtcatcaagt ctaacttcta gaccagggaa ttaatggggg agacagcgaa 180 

ccctagagca aagtgccaaa cttctgtcga tagcttgagg ctagtggaaa gacctcgagg 240 

aggctactcc agaagttcag cgcgtaggaa gctccgatac caatagccct ttgatgatgg 300 

tggggttggt gaagggaaca gtgctccgca aggttatccc tgccccaggc agtccaattt 360 

tcactctgca gattctctct ggctctaact accccagata acaaggagtg aatgcagaat 4 20 

agcacgggct ttagggccaa tcagacatta gttagaaaaa ttcctactac atggtttatg 480 

taaacttgaa gatgaatgat tgcgaactcc ccgaaaaggg ctcagacaat gccatgcata 540 

aagaggggcc ctgtaatttg aggtttcaga acccgaagtg aaggggtcag gcagccgggt 600 

acggcggaaa ctcacagctt tcgcccagcg agaggacaaa ggtctgggac acactccaac 660 

tgcgtccgga tcttggctgg atcggactct cagggtggag gagacacaag cacagcagct 720 

gcccagcgtg tgcccagccc tcccaccgct ggtcccggct gccaggaggc tggccgctgg 780 

cgggaagggg ccgggaaacc tcagagcccc gcggagacag cagccgcctt gttcctcagc 840 

ccggtggctt ttttttcccc tgctctccca ggggacagac accaccgccc cacccctcac 900 

gccccacctc cctgggggat cctttccgcc ccagccctga aagcgttaat cctggagctt 960 

tctgcacacc ccccgaccgc. tcccgcccaa gcttcctaaa .aaagaaaggt gcaaagtttg 1020 

gtccaggata gaaaaatgac tgatcaaagg caggcgatacttcctgttgc cgggacgcta 1080 
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tatataacgt gatgagcgca cgggctgcgg 
cctccaagcc cctgaggttt ccggggacca 
tggtaagtcc ctgggccagc cgacgggtgc 
tctcccaacc tcccagcgga ccggcgggga 
gcttggggtt aggctggagc aggaaaccgc 
gtccttttac gctgcaaagt tcctgctgac 
agcgagagag agggagagag agagagagag 
ttctgtcttc atctcagaat attaacgccc 
aacttgaact tttattatta gtggcaaaga 
agaagacgtt agtcttctta aaaccattat 
a 



agacgcaccg gagcgctcgc ccagccgccg 114 0 
caatgaacaa gttgctgtgc tgcgcgctcg 1200 
ccggcgcctg gggaggctgc tgccacctgg 1260 
gaaggctcca ctcgctccct cccaggagag 1320 
tttcaagtta tgccatgctt cccctagggt 1380 
tttatggaag acagcaagag agagacagac 1440 
aaacttgttt gaaagtttta gtcattaacc 1500 
tcatgtagtc catactatct ttgcttaatg 1560 
agtggtccct tagattcaga gtaagttgga 1620 
aattagaata tgacatgata gatttttcta 1680 

1681 



<210> 2 

<211> 9869 

<212> DNA 

<213> Homo sapiens 

<400> 2 

aagcttactt tgtgccaaat ctcattaggc 
atactgttgc acataagaac aaacctattt 
tctccttcta gtttctggac atctccatta 
agtaccttca ttatgacgaa gaaacctctc 
gtacctacct aaaacaacac tgtacagcaa 
accactacta cacagacagc tggcacacca 
gcaaggagct gcagtacgtc aagcaggagt 
gcaaggaagg gcgctacctt gagatagagt 
gatttggagt ggtgcaagct ggtacgtgtc 
aaagtcagat agttgtgaca gtttaggaga 
gcaaattgca aaggtaatga aacctgccag 
aggaccattg ctcagaggaa tactttgcca 
tgcagcaaat tattctctca tgagatgcat 
aaactcaagt tgcactattg atagttgatc 
accttcaaac tgcagcactt tttgacaaac 
aattatgctc atattaatga gactctggag 
gtaaaaaatg agaatggtga ggggaattgc 
cctttagcat gggagctgag tgtttgggag 
agcttattct ttatcttaga caaaacagat 
aaccaagtgc tttctctttt gcattttgaa 
tcttttaaac cagtaaccca cgtttttttt 
tataactttt catagcttga gaaaattaag 
tcagtataga ttctgatgac tcagtttgaa 
attttaagaa atatctggat tcctaggctg 
attgtagaat tttggcagca cccctggact 
ccccatgtga cagccaaaaa tgtcttcaga 
actcctggtt gagaacaggg tcatcaatgc 
acttgtgata tacaaagtct aaattattag 
aaatgaaaca ttcaaaaatc aaaatctatt 



ttaaggtaat acaggacttt gagtcaaatg 60 
tcatgctaag atgatgccac tgtgttcctt 120 
agtggaccac ccaggaaacg tttcctccaa 180 
atcagctgtt gtgtgacaaa tgtcctcctg 240 
agtggaagac cgtgtgcgcc ccttgccctg 300 
gtgacgagtg tctatactgc agccccgtgt 360. 
gcaatcgcac ccacaaccgc gtgtgcgaat 420 
tctgcttgaa acataggagc tgccctcctg 480 
aatgtgcagc aaaattaatt aggatcatgc 540 
aca'cttttgt tctgatgaca ttataggata 600 
gtaggtacta tgtgtctgga gtgcttccaa 660 
ctacagggca atttaatgac aaatctcaaa 720 
gatggttttt tttttttttt ttaaagaaac 780 
tatacqtcta tatttcactt cagcatggac 840 
atcagaaatg ttaatttata ccaagagagt 900 
tgctaacaat aagcagttat aattaattat 960 
atttcattat taaaaacaag gctagttctt 1020 
ggtaaggact atagcagaat ctcttcaatg 1080 
tgtcaagcca agagcaagca cttgcctata 1140 
cagcattggt cagggctcat gtgtattgaa 1200 
ctgccacatt tgcgaagctt cagtgcagcc 1260 
agtatccact tacttagatg gaagaagtaa 1320 
gcagtgtttc tcaactgaag ccctgctgat 1380 
gactcctttt tgtgggcagc tgtcctgcgc 1440 
ctagccacta gataccaata gcagtccttc 1500 
cactgtcaaa tgtcgccagg tggcaaaatc 1560 
taagtatctg taactatttt aactctcaaa 1620 
acgaccaata ctttaggttt aaaggcatac 1680 
ctgtttctca aatagtgaat cttataaaat 1740 
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taatcacaga agatgcaaat tgcatcagag tcccttaaaa ttcctcttcg tatgagtatt 1800 
tgagggagga attggtgata gttcctactt tctattggat ggtactttga gactcaaaag 1860 
ctaagctaag ttgtgtgtgt gtcagggtgc ggggtgtgga atcccatcag ataaaagcaa 1920 
atccatgtaa ttcattcagt aagttgtata tgtagaaaaa tgaaaagtgg gctatgcagc 1980 
ttggaaacta gagaattttg aaaaataatg gaaatcacaa ggatctttct taaataagta 2040 
agaaaatctg tttgtagaat gaagcaagca ggcagccaga agactcagaa caaaagtaca 2100 
cattttactc tgtgtacact ggcagcacag tgggatttat ttacctctcc ctccctaaaa 2160 
acccacacag cggttcctct tgggaaataa gaggtttcca gcccaaagag aaggaaagac 2220 
tatgtggtgt tactctaaaa agtatttaat aaccgttttg ttgttgctgt tgctgttttg 2280 
aaatcagatt gtctcctctc catattttat ttacttcatt ctgttaattc ctgtggaatt 2340 
acttagagca agcatggtga attctcaact gtaaagccaa atttctccat cattataatt 2400 
tcacattttg cctggcaggt tataattttt atatttccac tgatagtaat aaggtaaaat 2460 
cattacttag atggatagat ctttttcata aaaagtacca tcagttatag agggaagtca 2520 
tgttcatgtt caggaaggtc attagataaa gcttctgaat atattatgaa acattagttc 2580 
tgtcattctt agattctttt tgttaaataa ctttaaaagc taacttacct aaaagaaata 2640 
tctgacacat atgaacttct cattaggatg caggagaaga cccaagccac agatatgtat 2700 
ctgaagaatg aacaagattc ttaggcccgg cacggtggct cacatctgta. atctcaagag 2760 
tttgagaggt caaggcgggc agatcacctg aggtcaggag ttcaagacca gcctggccaa 2820 
catgatgaaa ccctgcctct actaaaaata caaaaattag cagggcatgg tggtgcatgc 2880 
ctgcaaccct agctactcag gaggctgaga caggagaatc tcttgaaccc tcgaggcgga 2940 
ggttgtggtg agctgagatc cctctactgc actccagcct gggtgacaga gatgagactc 3000 
cgtccctgcc gccgcccccg ccttcccccc caaaaagatt cttcttcatg cagaacatac 3060 
ggcagtcaac aaagggagac ctgggtccag gtgtccaagt cacttatttc gagtaaatta 3120 
gcaatgaaag aatgccatgg aatccctgcc caaatacctc tgcttatgat attgtagaat 3180 
ttgatataga gttgtatccc atttaaggag taggatgtag taggaaagta ctaaaaacaa 3240 
acacacaaac agaaaaccct ctttgctttg taaggtggtt cctaagataa tgtcagtgca 3300 
atgctggaaa taatatttaa tatgtgaagg ttttaggctg tgttttcccc tcctgttctt 3360 
tttttctgcc agccctttgt catttttgca ggtcaatgaa tcatgtagaa agagacagga 3420 
gatgaaacta. gaaccagtcc attttgcccc tttttttatt ttctggtttt ggtaaaagat 3480 
acaatgaggt aggaggttga gatttataaa tgaagtttaa taagtttctg tagctttgat 3540 
ttttctcttt catatttgtt atcttgcata agccagaatt ggcctgtaaa atctacatat 3600 
ggatattgaa gtctaaatct gttcaactag cttacactag atggagatat tttcatattc 3660 
agatacactg gaatgtatga tctagccatg cgtaatatag tcaagtgttt gaaggtattt 3720 
atttttaata gcgtctttag ttgtggactg gttcaagttt ttctgccaat gatttcttca 3780 
aatttatcaa atatttttcc atcatgaagt aaaatgccct tgcagtcacc cttcctgaag 3840 
tttgaacgac tctgctgttt taaacagttt aagcaaatgg tatatcatct tccgtttact 3900 
atgtagctta actgcaggct tacgcttttg agtcagcggc caactttatt gccaccttca 3960 
aaagtttatt ataatgttgt aaatttttac ttctcaaggt tagcatactt aggagttgct 4020 
tcacaattag gattcaggaa agaaagaact tcagtaggaa ctgattggaa tttaatgatg 4 080 
cagcattcaa tgggtactaa tttcaaagaa tgatattaca gcagacacac agcagttatc 4140 
ttgattttct aggaataatt gtatgaagaa tatggctgac aacacggcct tactgccact 4200 
cagcggaggc tggactaatg aacaccctac ccttctttcc tttcctctca catttcatga 4260 
gcgttttgta ggtaacgaga aaattgactt gcatttgcat tacaaggagg agaaactggc 4320 
aaaggggatg atggtggaag ttttgttctg tctaatgaag tgaaaaatga aaatgctaga 4380 
gttttgtgca acataatagt agcagtaaaa accaagtgaa aagtctttcc aaaactgtgt 4440 
taagagggca tctgctggga aacgatttga ggagaaggta ctaaattgct tggtattttc 4500 
cgtaggaacc ccagagcgaa atacagtttg caaaagatgt ccagatgggt tcttctcaaa 4560 
tgagacgtca tctaaagcac cctgtagaaa acacacaaat tgcagtgtct ttggtctcct 4 620 
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gctaactcag aaaggaaatg caacacacga 
tcaaaaatgt ggaataggta attacattcc 
catctctctc tctgagttga acacaaggcc 
ttccctttct tgaatcttaa ccagctaagg 
cactcagaat ctctcaaaaa ctcatcttct 
ctcctttcac actgaaatca aatcttgccc 
tgagatgaaa ttaggagagt ccaaactgta 
aatgtctgta ttattaacta aagtatatat 
tgatgacaaa ttaggccagg catggtggct 
caaggtaggc agatcacttg aggtcaggat 
ccttgtctct actaaaaata caaaaattag 
agctactcag ggctgaggca ggagaatcgc 
ctgagattgt accactgcac tccagtctgg 
cacacacaca cacacacaca cattagaaat 
gtgcatctat tgcatggaac ttccaagcta 
caggtcacta gtattaagtt caggttattc 
atcaggctag tgtgtgtgtt caccttgtca 
actcaaagac acattacact aaagatgatt 
taaaccagct tgactctccc caaacagttt 
gaaatgtgaa ttgatatata tatgagattc 
aattgaaatc atagacaagc cattttagcc 
aaaatgaagg aaggggtatt aaaaggagtg 
catttttaat tttacttttt ttcatttatt 
aggctttaac atttataaaa acactgtgaa 
cggcagaact agtattcaaa gccaggtctg 
cattttctgg gacatactta ctctacccag 
taacatagtt ttatgtttgg ttattttcct 
tctcttgctt tgtttccaaa ggtaaactat 
tattccaaat tactgttcaa attcctttaa 
tcatgaagtt ccctgtgggt actaggtaaa 
ttataagaat ttttggctgt tacttattta 
ctaaactttc tcttgaaaac aatgcccaaa 
tggtctctgc cactaagacc agccaacaga 
gcatatttta tcttggaaaa ttcaattgtg 
gactctcaga aatccaattg ttgagtaaat 
ttaccctgtg tgaggaggca ttcttcaggt 
ggcttagtgt cttggtagac aatttgcctg 
ggataaaacg gcaacacagc tcacaagaac 
atcaaaacaa agaccaagat atagtcaaga 
aaaagatcaa tcagaaatca aagacaccta 
tatgtttagt tgtgtggatc ttgtttccct 
ccacctgata atgtagatgt gattccacaa 
gctccccagt ttccttgtaa agtatgttga 
ggcagggctg tatctcaggg agtcgcttcc 
tctagaccac caaggagaag ctctataacc 
gaagctctgt tgtatttact tggtaattct 
atgttcttat taatcctcat gatatggcct 
tgagaattaa tgagataaaa tctgaaaagt 



caacatatgt tccggaaaca gtgaatcaac 4 680 
aaaatacgtc tttgtacgat tttgtagtat 4740 
tccagccaca ttcttggtca aacttacatt 4800 
ctactctcga tgcattactg ctaaagctac 4860 
cacagataac acctcaaagc ttgattttct 4920 
ataggcaaag ggcagtgtca agtttgccac 4980 
gaattcacgt tgtgtgttat tactttcacg 5040 
tggcaactaa gaagcaaagt gatataaaca 5100 
tactcctata atcccaacat tttggggggc 5160 
ttcaagacca gcctgaccaa catggtgaaa 5220 
ctgggcatgg tagcaggcac ttctagtacc 5280 
ttgaacccag gagatggagg ttgcagtgag 5340 
gcaacagagc aagatttcat cacacacaca 5400 
gtgtacttgg ctttgttacc tatggtatta 54 60 
ctctggttgt gttaagctct tcattgggta 5520 
ggatgcattc cacggtagtg atgacaattc 5580 
ctcccaccac tagactaatc tcagaccttc 5640 
tgcttttttg tgtttaatca agcaatggta 5700 
ttcgtactac aaagaagttt atgaagcaga 5760 
taacccagtt ccagcattgt ttcattgtgt 5820 
tttgctttct tatctaaaaa aaaaaaaaaa 5880 
atcaaatttt aacattctct ttaattaatt 5940 
gtgcacttac tatgtggtac tgtgctatag 6000 
agttgcttca gatgaatata ggtagtagaa 6060 
atgaatccaa aaacaaacac ccattactcc 6120 
atgctctggg ctttgtaatg cctatgtaaa 6180 
atgtaatgtc tacttatata tctgtatcta 6240 
gtgtctaaat gtgggcaaaa aataacacac 6300 
gtcagtgata attatttgtt ttgacattaa 6360 
cctttaatag aatgttaatg tttgtattca 6420 
caacaatatt tcactctaat tagacattta 6480 
aaagaacatt agaagacacg taagctcagt 6540 
agcttgattt tattcaaact ttgcatttta 6600 
ttggtttttt gtttttgttt gtattgaata 6660 
cttctgggtt- ttctaacctt tctttagatg 6720 
ttgctgttcc tacaaagttt acgcctaact 6780 
gcaccaaagt aaacgcagag agtgtagaga 6840 
agactttcca gctgctgaag ttatggaaac 6900 
agatcatcca aggtatgata atctaaaata 6960 
tttatcataa accaggaaca agactgcatg 7020 
gttggaatca ttgttggact gaaaaagttt 7080 
acagttatac aaggttttgt tctcacccct 7140 
acactctaag agaagagaaa tgcatttgaa 7200 
agatccctta acgcttctgt aagcagcccc 7260 
actttgtatc ttacattgca cctctaccaa 7320 
ctccaggtag gcttttcgta gcttacaaat 7380 
gcattaaaat tattttaatg gcatatgtta 7440 
gtttgagcct cttgtaggaa aaagctagtt 7500 
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acagcaaaat gttctcacat cttataagtt tatataaaga ttctccttta gaaatggtgt 7560 
gagagagaaa cagagagaga tagggagaga agtgtgaaag aatctgaaga aaaggagttt 7620 
catccagtgt ggactgtaag ctttacgaca catgatggaa agagttctga cttcagtaag 7680 
cattgggagg acatgctaga agaaaaagga agaagagttt ccataatgca gacagggtca 7740 
gtgagaaatt cattcaggtc ctcaccagta gttaaatgac tgtatagtct tgcactaccc 7800 
taaaaaactt caagtatctg aaaccggggc aacagatttt aggagaccaa cgtctttgag 7860 
agctgattgc ttttgcttat gcaaagagta aacttttatg ttttgagcaa accaaaagta 7 920 
ttctttgaac gtataattag ccctgaagcc gaaagaaaag agaaaatcag agaccgttag 7980 
aattggaagc aaccaaattc cctattttat aaatgaggac attttaaccc agaaagatga 8040 
accgatttgg cttagggctc acagatacta agtgactcat gtcattaata gaaatgttag 8100 
ttcctccctc ttaggtttgt accctagctt attactgaaa tattctctag gctgtgtgtc 8160 
tcctttagtt cctcgacctc atgtctttga gttttcagat atcctcctca tggaggtagt 8220 
cctctggtgc tatgtgtatt ctttaaaggc tagttacggc aattaactta tcaactagcg 8280 
cctactaatg aaactttgta ttacaaagta gctaacttga atactttcct . ttttttctga 8340 
aatgttatgg tggtaatttc tcaaactttt tcttagaaaa ctgagagtga tgtgtcttat 8400 
tttctactgt taattttcaa aattaggagc ttcttccaaa gttttgttgg atgccaaaaa 8460 
tatatagcat attatcttat tataacaaaa aatatttatc tcagttctta gaaataaatg 8520 
gtgtcactta actccctctc aaaagaaaag gttatcattg aaatataatt atgaaattct 8580 
gcaagaacct tttgcctcac gcttgtttta tgatggcatt ggatgaatat aaatgatgtg 8640 
aacacttatc tgggcttttg ctttatgcag atattgacct ctgtgaaaac agcgtgcagc 8700 
ggcacattgg acatgctaac ctcaccttcg agcagcttcg tagcttgatg gaaagcttac 87 60 
cgggaaagaa agtgggagca gaagacattg aaaaaacaat aaaggcatgc aaacccagtg 8820 
accagatcct gaagctgctc agtttgtggc gaataaaaaa tggcgaccaa gacaccttga 8880 
agggcctaat gcacgcacta aagcactcaa agacgtacca ctttcccaaa actgtcactc 8940 
agagtctaaa gaagaccatc aggttccttc acagcttcac aatgtacaaa ttgtatcaga 9000 
agttattttt agaaatgata ggtaaccagg tccaatcagt aaaaataagc tgcttataac 9060 
tggaaatggc cattgagctg tttcctcaca attggcgaga tcccatggat gagtaaactg 9120 
tttctcaggc acttgaggct ttcagtgata tctttctcat taccagtgac taattttgcc 9180 
acagggtact aaaagaaact atgatgtgga gaaaggacta acatctcctc caataaaccc 9240 
caaatggtta atccaactgt cagatctgga tcgttatcta ctgactatat tttcccttat 9300 
tactgcttgc agtaattcaa ctggaaatta aaaaaaaaaa actagactcc actgggcctt 9360 
actaaatatg ggaatgtcta acttaaatag ctttgggatt ccagctatgc tagaggcttt 9420 
tattagaaag ccatattttt ttctgtaaaa gttactaata tatctgtaac actattacag 9480 
tattgctatt tatattcatt cagatataag atttggacat attatcatcc tataaagaaa 9540 
cggtatgact taattttaga aagaaaatta tattctgttt attatgacaa atgaaagaga 9600 
aaatatatat ttttaatgga aagtttgtag catttttcta ataggtactg ccatattttt 9660 
ctgtgtggag tatttttata attttatctg tataagctgt aatatcattt tatagaaaat 9720 
gcattattta gtcaattgtt taatgttgga aaacatatga aatataaatt atctgaatat 9780 
tagatgctct gagaaattga . atgtacctta tttaaaagat tttatggttt tataactata 9840 
taaatgacat tattaaagtt ttcaaatta . 9869 



<210> 3 

<211> 1206 

<212> DNA 

<213> Homo sapiens 

<400> 3 



5 



wo 01/04137 



PCT/USOO/18803 



atgaacaagt tgctgtgctg cgcgctcgtg 
caggaaacgt ttcctccaaa gtaccttcat 
tgtgacaaat gtcctcctgg tacctaccta 
gtgtgcgccc cttgccctga ccactactac 
ctatactgca gccccgtgtg caaggagctg 
cacaaccgcg tgtgcgaatg caaggaaggg 
cataggagct gccctcctgg atttggagtg 
gtttgcaaaa gatgtccaga tgggttcttc 
agaaaacaca caaattgcag tgtctttggt 
cacgacaaca tatgttccgg aaacagtgaa 
ctgtgtgagg aggcattctt caggtttgct 
agtgtcttgg tagacaattt gcctggcacc 
aaacggcaac a'cagctcaca agaacagact 
aacaaagacc aagatatagt caagaagatc 
gtgcagcggc acattggaca tgctaacctc 
agcttaccgg gaaagaaagt gggagcagaa 
cccagtgacc agatcctgaa gctgctcagt 
accttgaagg gcctaatgca cgcactaaag 
gtcactcaga gtctaaagaa gaccatcagg 
tatcagaagt tatttttaga aatgataggt 
ttataa 



tttctggaca tctccattaa gtggaccacc 60 
tatgacgaag aaacctctca tcagctgttg 120 
aaacaacact gtacagcaaa gtggaagacc 180 
acagacagct ggcacaccag tgacgagtgt 240 
cagtacgtca agcaggagtg caatcgcacc 300 
cgctaccttg agatagagtt ctgcttgaaa 360 
gtgcaagctg gaaccccaga gcgaaataca 420 
tcaaatgaga cgtcatctaa agcaccctgt 480 
ctcctgctaa ctcagaaagg aaatgcaaca 540 
tcaactcaaa aatgtggaat agatgttacc 600 
gttcctacaa agtttacgcc taactggctt 660 
aaagtaaacg cagagagtgt agagaggata 720 
ttccagctgc tgaagttatg gaaacatcaa 780 
atccaagata ttgacctctg tgaaaacagc 840 
accttcgagc agcttcgtag cttgatggaa 900 
gacattgaaa aaacaataaa ggcatgcaaa 960 
ttgtggcgaa taaaaaatgg cgaccaagac 1020 
cactcaaaga cgtaccactt tcccaaaact 1080 
ttccttcaca gcttcacaat gtacaaattg 1140 
aaccaggtcc aatcagtaaa aataagctgc 1200 

1206 



<210> 4 
<211> 401 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Asn Lys Leu Leu Cys Cys Ala Leu Val Phe Leu Asp lie Ser lie 
1 5 10 . 15 

Lys Trp Thr Thr Gin Glu Thr Phe Pro Pro Lys Tyr Leu His Tyr Asp 
20 25 30 

Glu Glu Thr Ser His Gin Leu Leu Cys Asp Lys Cys Pro .Pro Gly Thr 
35 40 45 •" 

Tyr Leu Lys Gin His Cys Thr Ala Lys Trp Lys Thr Val Cys Ala Pro 
50 55 60 

Cys Pro Asp His Tyr Tyr Thr Asp Ser Trp His Thr Ser Asp Glu Cys 
65 70 75 80 

Leu Tyr Cys Ser Pro Val Cys Lys Glu Leu Gin Tyr Val Lys Gin Glu 
85 90 95 

Cys Asn Arg Thr His Asn Arg Val Cys Glu Cys Lys Glu Gly Arg Tyr 
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100 



105 



110 



Leu Glu He Glu Phe Cys Leu Lys His Arg Ser Cys Pro Pro Gly Phe 
115 120 125 

Gly Val Val Gin Ala Gly Thr Pro Glu Arg Asn Thr Val Cys Lys Arg 
130 135 1.40 

Cys Pro Asp Gly Phe Phe Ser Asn Glu Thr Ser Ser Lys Ala Pro Cys 
145 150 155 160 

Arg Lys His Thr Asn Cys Ser Val Phe Gly Leu Leu Leu Thr Gin Lys 
165 170 175 

Gly Asn Ala Thr His Asp Asn He Cys Ser Gly Asn Ser Glu Ser Thr 
180 185 190 

Gin Lys Cys Gly lie Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg 
195 200 205 

Phe Ala Val Pro Thr Lys Phe Thr Pro Asn Trp Leu Ser Val Leu Val 
210 215 220 

Asp Asn Leu Pro Gly Thr Lys Val Asn Ala Glu Ser Val Glu Arg He 
225 230 235 240 

Lys Arg Gin His Ser Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu 
245 250 255 

Trp Lys His Gin Asn Lys Asp Gin Asp He Val Lys Lys He He Gin 
260 265 270 

Asp He Asp Leu Cys Glu Asn Ser Val Gin Arg His He Gly His Ala 
275 280 285 

Asn Leu Thr Phe Glu Gin Leu Arg Ser Leu Met Glu Ser Leu Pro Gly 
290 295 300 

Lys Lys Val Gly Ala Glu Asp He Glu Lys Thr He Lys Ala Cys Lys 
305 310 315 320 

Pro Ser Asp Gin He Leu' Lys Leu Leu Ser Leu Trp Arg He Lys Asn 
325 330 335 

Gly Asp Gin Asp Thr Leu Lys Gly Leu Met His Ala Leu Lys His Ser 
340 345 350 



Lys Thr Tyr His Phe Pro Lys Thr Val Thr Gin Ser Leu Lys Lys Thr 
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355 



360 



365 



He Arg Phe Leu His Ser Ph Thr Met Tyr Lys Leu Tyr Gin Lys Leu 

375 380 

Phe Leu Glu Met He Gly Asn Gin Val Gin Ser Val Lys He Ser Cys 
390 395 400 



Leu 



<210> 5 
<211> 15 

<212> DNA 

<213> Homo sapiens 
<400> 5 

acttgaagat gaatg 

<210> 6 
<2H> 15 
<212> DNA 

<213> Homo sapiens 
<400> 6 

acttgaaaat gaatg 

<210> 7 

<2li> 15 

<212> DNA 

<213> Homo sapiens 

<400> 7 

gatcttggct ggatc 

<210> 8 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<460> 8 

gatGttgtct ggatc 



8 



wo 01/04137 



PCT/USOO/18803 



<210> 9 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 9 

ccaccgcccc acccc 15 



<210> 10 
<2H> 15 . 
<212> DNA • 
<213> Homo sapiens 

<400> 10 

ccaccgctcc acccc 15 



<210> 11 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 11 

tccctggggg atcct 15 



<210> 12 - 

<211> 15 • 

<212> DNA 

<213> Homo sapiens 

<400> 12 

tccctggtgg atcct 15 



<210> 13 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 13 

gcgttaatcc tggag • 15 



<210> 14 
<211> 15 
<212> DNA 
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<213> Homo sapiens 
<400> 14 

igcgttaaccc tggag 

<210> 15 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 15 

cctgggccag ccgac 

<210> 16 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 16 • 
cctgggctag ccgac 

<210> 17 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 17 

gggagaaggc tccac 

<210> 18 

<211> 15 

<212> DNA 

<2i3> Homo sapiens 

<400> 18 

gggagaaagc tccac 

<210> 19 

<211> 15 

<212> DNA 

<213> Homo sapiens 



<400> 19 
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ccttttacgc tgcaa 15 



<210> 20 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 20 

ccttttaagc tgcaa 15 



<210> 21 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 21 

gctggtacgt gtcaa 15 



<210> 22 
<211>15 
<212> DNA 

<213> Homo sapiens 
<400> 22 

gctggtatgt gtcaa 15 



<210> 23 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 23 

aggaccattg ct-cag 15 



<210> 24 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 24 

aggaccactg ctcag 15 
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<210> 25 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 25 

aacataatag. tagca 1^ 



<210> 26 

<2H> 15 

<212> DNA 

<213> Homo' sapiens 

<400> 26 

aacataacag tagca 15 



<210> 27 
<211> 15 
<212> DNA 
<213> Homo 

<400> 27 
tattttccgt 



sapiens 



aggaa 



<210> 28 
<211> 15 
<212> DNA 
<213> Homo 

<400> 28 
tattttctgt 



sapiens 



aggaa 



<210> 29 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 29 

cattttagca tattt 



<210> 30 
<211> 15 
<212> DNA 
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<213> Homo sapiens 
<400> 30 

cattttaaca tattt 15 



<210> 31 

<211> 15 

<212> DNA- 

<213> Homo sapiens 

<400> 31 

aagtaaacgc agaga 15 



<210> 32 

<211> 15 

.<212> DNA 

<213> Homo sapiens 

<4 00> 32 

aagtaaatgc agaga 15 



<210> 33 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 33 

gtgtagagag gataa 15 



<210> 34 
<211> 15 
<212> DNA 

<213> Homo sapiens •'■ 
<400> 34 

gtgtagaaag gataa 15 



<210> 35 
<211> 15 
<212> DNA 
<213> Homo 

<400> 35 



sapiens 



13 



wo 01/04137 



PCTAJSOO/18803 



agaggataaa acggc 



<210> 36 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 36 

agaggatgaa acggc 15 

<210> 37 
<211> 15 
<212> DNA 

<213> Homo sapiens 



<210> 38 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 38 

tgaagttgtg gaaac 15 

<210> 39 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 39 

tgaagttatg gaaac 15 

<210> 40 

<211> 15 

<212> DNA 

<213> Homo sapiens 



<400> 37 



tgaagttatg 



gaaac 



15 



<400> 40 



tgaagttgtg 



gaaac 



15 
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<210> 41 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 41 

gtatgataat ctaaa 15 



<210> 42 
<211> 15 
<212> DNA • 
<213> Homo sapiens 

<400> 42 

gtatgatcat ctaaa 15 



<210> 43 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 43 

tagttacggc aatta 15 



<210> 44 
<2H> 15 

<212> DNA 
<2l3>.Homo sapiens 

<400> 44 

tagttacagc aatta 15 



<210> 45 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 45 

agagtgatgt gtctt 15 



<210> 46 
<211> 15 
<212> DNA 



15 
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<213> Homo sapiens 
<400> 46 

agagtgacgt gtctt 15 



<210> 47 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 47 

ggcattggat gaatat 16 



<210> 48 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 48 

ggcattgtat gaatat 16 



<210> 4 9 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 49 

aaacagcgtg cagcg 15 



<210> 50 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 50 

aaacagcatg cagcg 15 



<210> 51 
<211> 15 
<212> DNA 
<213> Homo 

<400> 51 



sapiens 



16 
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aaagaagacc atcag 



<210> 52 

<211> 15 

<212> DNA 

<213> Homo sapiens 

.<400> 52 
aaagaagtcc atcag 



<210> 53 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 53 

tcagaagtta ttttt 



<210> 54 
<211> 15 

<212> DNA 
<213> Homo 

<400> 54 
tcagaagcta 



sapiens 



ttttt 



<210> 55 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 55 

atgtaaactt gaaga ■ 



<210> 56 
•<211> 17 
<212> DNA 
<213> Homo sapiens 

<400> 56 

ttcgcaatca ttcatct 
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<210> 57 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 57 

atgtaaactt gaaaa 15 



<210> 58 

<211> 17 

<212> DNA 

<213> Homo sapiens 

<400> 58 

ttcgcaatca ttcattt 17 



<210> 59 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<4D0> 59 

cgtccggatc ttggc 15 



<210> 60 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 60 

gagtccgatc cagcc 15 



<210> 61 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 61 

cgtccggatc ttgtc 15 



<210> 62 
<211> 15 
<212> DNA 
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<213> Homo sapiens 



<400> 62 



gagtccgatc cagac 



IS 



<210> 63 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 63 

cagacaccac cgccc 15 

<210> 64 

<211> 15 

<212> DNA 

<213> Homo sapiens 



<210> 65 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 65 

cagacaccac cgctc 15 

<210> 66 

<211> 15 

<212> DNA 

<213> Homo sapiens 



<400> 64 



gcgtgagggg tgggg 



15 



<400> 66 



gcgtgagggg tggag 



15 



<210> 67 



<211> 15 



<212> DNA 



<213> Homo sapiens 



<400> 67 



wo 01/04137 



cccacctccc tgggg 



PCTAJSOO/18803 
15 



<210> 68 

<211> 15 

<212> DNA 

<213> Honio sapiens 

<400> 68 

gcggaaagga tcccc 15 



<210> 69 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 69 

cccacctccc tggtg 15 



<210> 70 

<211> 15 

<212> DNA 

<2r3> Homo sapiens 

<400> 70 

gcggaaagga tccac 15 



<210> 71 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 71 

ctgaaagcgt taatc -15 



<210> 72 

<211> .15 

<212> DNA 

<213> Homo sapiens 

<400> 72 

agaaagctcc aggat 15 
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<210> 73 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 73 

ctgaaagcgt taiacc 



<210> 74 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 74 

agaaagctcc agggt 



<210> 75 
<2li> 15 
<2I2> DNA 
<213> Homo 

<400> 75 
taagtccctg 



sapiens 



ggcca 



<210> 76 
<211> 15 
<212> DNA 
<213> Homo 

<400> 76 
gcacccgtcg 



sapiens 



gctgg 



<210> 77 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 77 

taagtccctg ggcta 



<210> 78 
<211> 15 
<212> DNA 
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<213> Homo sapiens 

<400> 78 

gcacccgtcg gctag 15 



<210> 79 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 79 

ccggcgggga gaagg 15 



<210> 80 
•<211> 15 , 
<212> DNA 
<213> Homo sapiens 

<400> 80 - 

gagcgagtgg agcct 15 



<210> 81 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 81 

ccggcgggga gaaag IS 



<210> 82 - 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 82 

gagcgagtgg agott 15 



<210> 83 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 83 
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gggtgtcctt ttacg 15 



<210> 84 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 84 

ggaactttgc agcgt 15 



<210> 85 

<2H> 15 

<212> DNA 

<213> Homo sapiens 

<400> 85 

gggtgtcctt ttaag 15 



<210> 86 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 86 

ggaactttgc agctt 15 



<210> 87 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 87 

gtgcaagctg gtacg 15 



<210> 88 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 88 

tgcacattga cacgt 15 
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<210> 89 

<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 89 

gtgcaagctg gtatg 15 



<210> 90 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 90 

tgcacattga cacat 15 



<210> 91 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 91 

ttccaaagga ccatt 15 



<210> 92 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 92 

attcctctga gcaat 15 



<210> 93 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 93 

ttccaaagga ccact 15 



<210> 94 
<211> 15 
<212> DNA 
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<213> Homo sapiens 
<400> 94 

attcctctga gcagt 15 



<210> 95 
<211> 15 
<212> DMA 

<213> Homo sapiens 
<400> 95 

ttgtgcaaca taata 15 



<210> 96 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 96 

ttttactgct actat 15 



<210> 97 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 97 

ttgtgcaaca taaca 15 



<210> 98 
<211> 15 
<212> DNA 
<213> Homo 



sapiens 



<400> 98 

ttttactgct actgt 



<210> 99 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 99 



25 
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gcttggtatt ttccg ' 15 



<210> 100 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 100 

ctggggttcc tacgg 15 

<210> 101 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 101 

gcttggtatt ttctg 15. 



<210> 102 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 102 

ctggggttcc tacag 15 



<210> 103 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 103 

actttgcatt ttagc 15 



<210> 104 

<211> 15 

<;2.12> DNA 

<213> Homo sapiens 

<400> 104 

aagataaaat atgct IS 
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<210> 105 

<21I> 15 

<212> DNA 

<213> Homo sapiens 

<400> 105 

actttgcatt ttaac 15 



<210> 106 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 106 
aagataaaat atgtt 

<210> 107 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 107 
gcaccaaagt aaacg 



<210> 108 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 108 
ctacactctc tgcgt 



<210> 109 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 109 
gcaccaaagt aaatg 



<210> 110 
<211> 15 
<212> DNA 



27 



wo 01/04137 



PCT/USOO/18803 



<213> Homo sapiens 
<400> 110 

ctacactctc tgcat 15 



<210> 111 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 111 

cagagagtgt agaga 15 



<210> 112 

<211> 13 

<212> DNA 

<213> Homo sapiens 

<400> 112 

cgcgttttat cct 13 



<210> 113 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 113 

cagagagtgt agaaa 15 



<210> 114 

<211> 13 

<212> DNA 

<213> Homo sapiens 

<400> 114 

cgcgttttat ctt 13 



<210> 115 

<211> 15 

<212> DNA 

.<213> Homo sapiens 

<400> 115 



28 
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gtgtagagag gataa 15. 



<210> 116 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 116 

tgtgttgccg tttta 15 



<210> 117 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 117 

gtgtagagag gatga 15 



<210> 118 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 118 

tgtgttgccg tttca 15 



<210> 119 

<211> 15 

<212> DNA . 

<213> Homo sapiens 

<400> 119 

agctgctgaa gttat • • . 15 



<210> 120 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400>- 120 

tttgatgttt ccata 15 
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<210> 121 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 121 
agctgctgaa gttgt 

<210> 122 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 122 
tttgatgttt ccaca 



<210> 123 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 123 
tccaaggtat gataa 

<210> 124 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 124 
ttttatttta gatta 



<210> 125 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 125 
tccaaggtat gatca 



<210> 126 
<211> 15. 
<212> DNA 



30 
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<213> Homo sapiens 



PCT/USOO/18803 



<400> 126 
ttttatttta gatga 



<210> 127 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 127 
aaaggctagt tacgg 

<210> 128 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 128 
.tgataagtta attgc 

<210> 129 

<211> 15 

<212> DljlA 

<213> Homo sapiens 

<400> 129 
aaaggctagt tacag 

<210> 130 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 130 
tgataagtta atttc 



<210> 131 
<211> 15 

<212> DNA 

•<213> Homo sapiens 
<400> 131 
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aaactgagag tgatg 15 



<210> 132 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 132 

gaaaataaga cacat 15 



<210> 133 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 133 

aaactgagag tgacg 15 



<210> 134 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 134 

gaaaataaga cacgt 15 



<210> 135 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 135 

tatgatggca ttgga .15 



<210> 136 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 136 

catttatatt catcc 15 
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<210> 137 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 137 

tatgatggca ttgta 15 



<210> 138 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 138 

catttatatt catac 15 



<210> 139 

<211> 15 

<212> DNA . 

<213> Homo sapiens 

<400> 139 

ctgtgaaaac agcgt 15 



<210> 140 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 140 

atgtgccgct gcacg 15 



<210> 141 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 141 

ctgtgaaaac agcat 15 



<21C> 142 
<211> 15 
<212> DNA 
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<213> Homo sapiens 
<400> 142 

atgtgccgct gcatg 15 



<210> 143 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 143 
gagtctaaag aagac 



<210> 144 

<211> 15 

<212> DtOA 

<213> Homo sapiens 

<400> 144 

aggaacctga tggtc . . 15 



<210> 145 

<211> 15 
.<212> DNA 
<213>: Homo sapiens 

<400> 145 
gagtctaaag aagtc 



<210> 146 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 146 
aggaacctga tggac 

<210> 147 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 147 
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15 



<210> 148 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 148 

atttctaaaa ataac 15 



<210> 149 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 149 

attgtatcag aagct 15 



<2io> isa 

<211> 15 ■ 

<212> DNA 

<213> Homo sapiens 

<400> 150 

atttctaaaa atagc 15 



<210> 151 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 151 

taaacttgaa ' 10 



<210> 152 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 152 

aatcattcat 10 
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<210> 153 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 153 

ccggatcttg ' 10 



<210> 154 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 154 

tccgatccag 10 



<210> 155 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 155 

acaccaccgc 10 



<210> 156 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 156 

tgaggggtgg 10 



<210> 157 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 157 

acctccctgg 10 



<210> 158 

<211> 10 
<212> DNA 
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<213> Homo sapiens 

<400> 158 
gaaaggatcc 

<210> 159 • 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 159 

aaagcgttaa 10 



<210> 160 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 160 

aagctccagg ^.0 



<210> 161 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 161 
gtccctgggp 



<210> 162 

<211> 10 ■ 

<212> DNA 

<213> Homo sapiens 

<400> 162 

cccgtcggct • 10 



<210> 163 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 163 
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gcggggagaa 10 



<210> 164 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 164 
cgagtggagc 



<210> 165 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 165 

tgtcctttta 10 



<210> 166 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 166 
actttgcagc 



<210> 167 

<211> 10 

<212> DNA • 

<2.13> Homo sapiens 

<400> 167 
caagctggta 

<210> 168 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 168 
acattgacac 
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<210> 169 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 169 
caaaggacca 

<210> 170 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 170 
cctctgagca 

<210> 171 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<.400> 171 
tgcaacataa 



<210> 172 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 172 
tactgctact 



<210> 173 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 173 
tggtattttc 



<210> 174 
<211> 10 
<212> DNA 
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<213> Homo sapiens 
<400> 174 

gggttcctac 10 



<210> 175 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 175 

ttgcatttta 10 



<210> 176 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 176 

ataaaatatg 10 



<210> 177 
<211> 10 
<212> DNA 

<213> Homo sapiens 
<400> 177 

ccaaagtaaa 10 



<210> 178 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 178 

cactctctgc 10 



<210> 179 
<211> 10 
<212> DNA 

<213> Homo sapiens 
<400> 179 
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agagtgtaga 10 



<210> IBO 
<211> io 
<212> DNA 

■<213> Homo sapiens 
<400> 180 

gttttatcct 10 



<210> 181 

<2H> 10 

<212> ONA 

<213> Homo sapiens 

<400> 181 

tagagaggat 10 



<210> 182 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 182 

gttgccgttt 10 



<210> 183 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 183 

tgctgaagtt 10 



<210> 184 

<211> 10 

<212> DNA 

<213> Homo sapiens 

'<400> 184 

gatgtttcca 10 
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<210> 185 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 185 

aaggtatgat 10 



<210> 186 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 186 

tattttagat 10 



<210> 187 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 187 

ggctagttac 10 



<210> 188 

<2H> 10 

<212> DNA 

<213> Homo sapiens 

<400> 188 

agttaattgc 10 



<210> 189 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 189 
ctgagagtga 



<210> 190 
<211> 10 
<212> DNA 
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<213> Homo sapiens 
<400> 190 

aataagacac 10 

<210> 191 

<211> 9 

<212> DNA 

<213> Homo sapiens 

<400> 191 

atggcattg 9 



<210> 192 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 192 

ttatattcat 10 



<210> 193 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 193 

tgaaaacagc 10 



<210> 194 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 194 

tgccgctgca 10 



<210> 195 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 195 
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tctaaagaag 



10 



<210> 196 



<211> 10 



<212> DNA 



<213> Homo sapiens 



<400> 196 



aacctgatgg 



10 



.<210> 197 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 197 

gtatcagaag 10 

<210> 198 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 198 

tctaaaaata 10 

<210> 199 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 199 

gtacggcgga aactcacagc 20 

<210> 200 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 200 

gcacagcaac ttgttcattg tgg 23 
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<210> 201 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 201 

tctcccaggg gacagacacc 20 



<210> 202 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 202 

taaaaggaca ccctagggga age 23 



<210> 203 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 203 

atcctggagc tttctgcaca cc 22 



<210> 204 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 204 

ctgaatctaa gggaccactt ctttgc 26 



<210> 205 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 205 

gctaagatga tgccactgtg ttcc 24 



<210> 206 
<211> 23 
<212> DNA 
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<213> Homo sapiens 

<400> 206 

gccctgtagt ggcaaagtat tec 23 



<210> 207 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 207 

cctttcctct cacatttcat gage 24 



<210> 208 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 208 

caaaettgac actgceettt ge 22 



<210> 209 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 209 

acacgtaagc tcagttggtc tctge 25 



<210> 210 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 210 

gagcaggggt gagaacaaaa cc 22 



<210> 211 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 211 
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cttccaaagt tttgttggat gcc 23 



<210> 212 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 212 

agtttactca tccatgggat ctcg 24 



<210> 213 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 213 

cccagcgaga ggacaaaggt 20 



<210> 214 

<2H> 19 

<212> DNA 

<213> Homo sapiens 

<400> 214 

ggaaacctca ggggcttgg . 19 



<210> 215 
<211> 21 . 
<212> DNA 

<213> Homo sapiens 
<400> 215 . 

cccagccctg aaagcgttaa t - . ' 21 



<210> 216 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 216 

aaagcggttt cctgctccag • 20 
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<210> 217 
<211> 20 
<212> ONA 

<213> Homo sapiens 
<400> 217 

ccgcccaagc ttcctaaaaa 20 



<210> 218 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 218 

tctccctctc tctcgctgtc tg 22 



<210> 219 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 219 

agtggaccac ccaggaaacg 20 



<210> 220 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 220 ■ 

gcaatggtcc tttggaagca 20 



<210> 221 
<211> 20 
<212> DNA 

<213> Homo sapiens' 
<400>-221 

aactggcaaa ggggatgatg 20 



<210> 222 
<211> 22 
<212> DNA 
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<213> Homo sapiens 
<400> 222 

gcatcgagag tagccttagc tg 22 



<210> 223 

<211> 20 

<212> DNA" 

<213> Homo sapiens 

<400> 223 

accagcqaac agaagcttga 20 



<210> 224 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 224 

gtccaacaat gattccaaca gg 22 



• <210> 225 
<211> 22 
<212> DNA 
<213> Homo sapiens 

<400> 225 

ggtgtcactt aactccctct ca 22 



<210> 226 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 226 

ctgattggac ctggttacct 20 



<210> 227 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 227 
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ctgtaaacaa tttcagtggc aacccg 26 



<210> 228 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 228 

ccgtgctatt ctgcattcac tccttg 26 



<210> 229 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 229 

cgtaggaagc tccgatacca atagcc 26 



<210> 230 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 230 

ggagagcagg ggaaaaaaaa gcc 23 



<210> 231 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 231 

ttgaggtttc agaacccgaa gtgaag 26 



<210> 232 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 232 

caacaggaag tatcgcctgc ctttg 25 
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<210> 233 

<211> 26 

<212> DNft 

<213> Homo sapiens 

<400> 233 

gttaatcctg gagctttctg cacacc 26 



<210> 234 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 234 

tcgctgtctg tctctctctt gctgtc 26 



<210> 235 

<211> 24 . 

<212> DNA. 

<213> Homo sapiens 

<400> 235 

tcatgctaag atgatgccac tgtg 24 



<210> 236 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 236 

gcaaagtatt cctctgagca atgg 24 



<210> 237 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 237 

ctgtgttaag agggcatctg ctg 23 



<210> 238 
<211> 22 
<212> DNA 
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<400> 238 

ttgaccaaga atgtggctgg ag 22 



<210> 239 

<211> 25 

<212> DNA ■ 

<213> Homo sapiens . 

<400> 239 

gccactaaga ccagccaaca gaagc 25 



<210> 240 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 240 

gagcaggggt gagaacaaaa ccttg 25 



<210> 241 

<211> 26. 

<212> DNA 

<213> Homo sapiens 

<400> 241 

ggctgtgtgt ctcctttagt tcctcg 26 



<210> 242 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 242 

ctgacagttg gattaaccat ttgggg 26 



<210> 243 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 243 
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tgggaaagtg gtacgtcttt gagtgc 26 



<210> 244 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 244 

ttgacctctg tgaaaacagc gtgc 24 

<210> 245 

<2H> 26 

<212> DNA 

<213> Homo sapiens 

<400> 245 

ctgaaagcct' caagtgcctg agaaac 26 
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